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20 FIELD OF THE INVENTION 

The present invention relates to receptor 
proteins, DNA sequences encoding same, and various uses 
therefor. 

25 

BACKGROUND OF THE INVENTION 

Activins are dimeric proteins which have the 
ability to stimulate the production of follicle stimulating 
30 hormone (FSH) by the pituitary gland. Activins share a 
common subunit with inhibine, which inhibit FSH secretion, 

Activins are members of a superfamily of 
polypeptide growth factors which includes the inhibins, the 
35 transforming growth factors-/? (TGF-jff) , Mullerian duct 



-2- 



inhibiting substance, the Drosophila decapentaplegic 
peptide, several bone morphogenetic proteins, and the 
Vg-related peptides. 

5 As a result of their extensive anatomical 

distribution and multiple biological actions, members of 
this superfamily of polypeptide growth factors are believed 
to be involved in the regulation of numerous biological 
processes. Activin, for example, is involved in the 
10 proliferation of many tumor cell lines, the control of 
secretion and expression of the anterior pituitary hormones 
(e.g., FSH, GH and ACTH) , neuron survival, hypothalamic 
oxytocin secretion, erythropoiesis, placental and gonadal 
steroidogenesis, early embryonic development, and the like. 

15 

Other members of the activin/TGF-jS superfamily of 
polypeptide growth factors are involved in the regulation 
of cell function and cell proliferation for numerous cell 
types, in adults and embryos. For example, cells which are 

20 subject to regulation by one or more members of the 
activin/TGF-j3 superfamily of polypeptide growth factors 
include mesenchymal cells, muscle cells, skeletal cells, 
immune cells, hematopoietic cells, steroidogenic cells, 
endothelial cells, liver cells, epithelial cells, and the 

25 like. 

Chemical cross -linking studies with a number of 
cell types suggests that multiple binding sites (i.e., 
receptors) exist on the surface of cells. However, little 
3 0 is known about the structure of these receptors, or about 
the second messenger signalling systems that they employ. 
It would be desirable, therefore, if the nature of these 
poorly characterized receptor proteins could be more fully 
understood . 

35 
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BRIEF DESCRIPTION OF THE INVENTION 



In accordance with the present invention, we have 
identified and characterized members of a new superfamily 
5 of receptor proteins which comprise three distinct domains: 
an extracellular, ligand-binding domain, a hydrophobic, 
trans -membrane domain, and an intracellular, receptor 
domain having serine kinase- like activity. 

10 Also provided are DNAs encoding the above - 

described receptor proteins, and antibodies thereto, as 
well as bioassays, therapeutic compositions containing such 
proteins and/or antibodies, and applications thereof. 

15 The DNAs of the invention are useful as probes 

for the identification of additional members of the 
invention superfamily of receptor proteins, and as coding 
sequences which can be used for the recombinant expression 
of the invention receptor proteins, or functional fragments 

20 thereof. The invention receptor proteins, and antibodies 
thereto, are useful for the diagnosis and therapeutic 
management of carcinogenesis , wound healing, disorders of 
the immune, reproductive, or central nervous systems, and 
the like. 

25 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a schematic diagram of receptors of 
the invention and the various domains thereof. 

30 

Figure 2 outlines the strategy used for 
expression cloning of a receptor of the activin/TGF-/3 
receptor superfamily. 

35 Figure 3 is a schematic of two mouse activin 

receptor clones. The top line of the figure is a 
restriction map, in kb, of mActRl and mActR2, with 
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numbering starting from bp 1 of mActR2 . The dotted line in 
the figure represents 5' untranslated sequences present 
only in mActRl. The middle lines present a schematic 
representation of two activin receptor cDNA clones* Boxes 

5 represent coding sequences black is the signal peptide, 

white is the extracellular ligand-binding domain, gray is 
the transmembrane, and the intracellular kinase domain is 
hatched. Amino acids are numbered beneath the schematics. 

10 Figure 4 presents a comparison between activin 

receptor and daf-1 [a C. elegans gene encoding a putative 
receptor protein kinase {with unknown ligand) ; see Georgi, 
et al., Cell 61: 635-645 (1990)]. Conserved residues 
between the activin receptor and daf-1 are highlighted; 

15 conserved kinase domain residues are designated with an 

Figure 5A summarizes results of 125 I activin A 
binding to COS cells transfected with pmActRl. Binding was 

20 competed with unlabeled activin A. For the runs reported 
herein, total binding was 4.6% of input cpm, non-specific 
binding was 0.9% of input cpm, and therefore the specific 
binding was 3.7% of input cpm. Data are shown as % 
specific binding, normalized to 100%. The inset presents 

25 a Scatchard analysis of the data [Ann. NY Acad. Sci. 51: 
660-672 (1979)] . 

Figure 5B summarizes results of 125 I activin A 
binding to COS cells transfected with pmActR2 . Binding was 
30 competed with unlabeled factors as indicated in the figure. 
For the runs reported herein, total binding was 3.4% of 
input cpm, non-specific binding was 0.9% of input cpm, and 
therefore the specific binding was 2.5% of input cpm. Data 
are shown as % specific binding, normalized to 100%. 

35 

Figure 6 is a phylogenetic tree, comparing the 
relationship of the activin receptor kinase domain to other 
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protein kinases. To construct the tree, the catalytic 
domains of representative sequences were empirically- 
aligned and evolutionary relatedness was calculated using 
an algorithm designed by Fitch and Margoliash [Science 155 : 
5 279-284 (1967)], as implemented by Feng and Doolittle [J . 
Mol. Evol. 25.: 351-360 (1987)]. Known subfamilies of 
kinases are indicated in the figure. For those sequences 
that had similarity scores (i,e., a relative sequence 
identity) of at least 4 standard deviations above the mean 
10 (in comparison with all other known kinase sequences) , the 
percent identity with the activin receptor is indicated. 
For further detail on kinase sequences, the reader is 
referred to Hanks and Quinn, Meth. Enzymol. 200 : 38-62 
(1991) . 

15 

DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, there 
is provided a novel superfamily of receptor protein (s) 
2 0 characterized by having the following domains, reading from 
the N- terminal end of said protein: 

an extracellular, ligand-binding domain, 
a hydrophobic, trans -membrane domain, and 
an intracellular domain having serine kinase -like 
25 activity. 

The novel receptor protein (s) of the invention 
optionally further comprise a second hydrophobic domain at 
the amino terminus thereof. 

30 

As employed herein, the phrase "extracellular, 
ligand-binding domain" refers to that portion of receptors 
of the invention which has a high affinity for ligand, and 
which, when associated with a cell, resides primarily 
35 outside of the cell membrane. Because of its location, 
this domain is not exposed to the processing machinery 
present within the cell, but is exposed to all components 
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of the extracellular medium. See Figure 1. 

As employed herein, the phrase "hydrophobic, 
trans -membrane domain" refers to that portion of receptors 
5 of the invention which traverses the cell membrane, and 
serves as a "bridge" between the extracellular and 
intracellular domains of the receptor. The hydrophobic 
nature of this domain serves to anchor the receptor to the 
cell membrane. See Figure 1. 

10 

As employed herein, the phrase "intracellular 
domain having serine kinase -like activity" refers to that 
portion of receptors of the invention which resides within 
the cytoplasm, and which embodies the catalytic 
15 functionality characteristic of all receptors of the 
invention. See Fig 1. 

The optional second hydrophobic domain, 
positioned at the amino terminus of receptors of the 
20 invention, comprises a secretion signal sequence which 
promotes the intracellular transport of the initially 
expressed receptor protein across the Golgi membrane . See 
Figure 1 . 

25 Members of the invention superf amily of receptors 

can be further characterized as having sufficient binding 
affinity for at least one member of the activin/TGF-/3 
super family of polypeptide growth factors such that 
concentrations of 5 10 nM of said polypeptide growth factor 

30 occupy a 50% of the binding sites of said receptor protein. 

Binding affinity (which can be expressed in terms 
of association constants, Ka, or dissociation constants, 
Kd) refers to the strength of interaction between ligand 
35 and receptor, and can be expressed in terms of the 
concentration of ligand necessary to occupy one-half (50%) 
of the binding sites of the receptor. A receptor having a 
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high binding affinity for a given ligand will require the 
presence of very little ligand to become at least 50% bound 
(hence the Kd value will be a small number) ; conversely, 
receptor having a low binding affinity for a given ligand 
5 will require the presence of high levels of ligand to 
become 50% bound (hence the Kd value will be a large 
number) - 

Reference to receptor protein "having sufficient 
10 binding affinity such that concentrations of said 
polypeptide growth factor less than or equal to 10 nM 
(i.e., <; 10 nM) occupy a 50% (i.e., greater than or equal 
to one-half) of the binding sites of said receptor protein" 
means that ligand (i.e-, polypeptide growth factor) 
15 concentration (s) of no greater than about 10 nM are 
required in order for the ligand to occupy at least 50% of 
the active sites of said receptor (preferably about 0.1-1.0 
nM of said receptor) , with much lower ligand concentrations 
typically being required. Presently preferred receptors of 
20 the present invention have a binding affinity such that 
ligand concentration (s) in the range of only about 100 - 
500 pM are required in order to occupy (or bind to) at 
least 50% of the receptor binding sites, wherein the 
receptor concentration is preferably about 0.1-1.0 nM. 

25 

Members of the invention superfamily of receptors 
can be divided into various subclasses, based on the 
approximate size of the crossl inked complexes obtained when 
radiolabeled activin is chemically crossl inked to cell 

30 extracts [see, for example, Example VI below, or Mathews 
and Vale in Cell 65:973-982 (1991)]. Type I activin/TGF-jS 
receptors are those which form a crossl inked complex of 
about 65 kD with activin; Type II receptors are those which 
form a crosslinked complex of about 80-85 kD with activin; 

35 while Type III, Type IV and the like receptors are those 
which form crosslinked complexes with activin having 
molecular weights greater than about 100 kD. 
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Each member of a given subclass is related to 
other members of the same subclass by the high degree of 
homology (e.g., >80% overall amino acid homology; 
frequently having >90% overall amino acid homology) between 
5 such receptors; whereas members of a given subclass differ 
from members of a different subclass by the lower degree of 
homology (e.g., at least about 30% up to 80% overall amino 
acid homology; with in the range of about 40% up to 90% 
amino acid homology specifically in the kinase domains 

10 thereof) between such receptors. Typically, related 
receptors have at least 50% overall amino acid homology; 
with at least about 60% amino acid homology in the kinase 
domains thereof. Preferably, related receptors are defined 
as those which have at least 60% overall amino acid 

15 homology; with at least about 70% amino acid homology in 
the kinase domains thereof . 

Based on the above criteria, the receptors 
described herein are designated Type II receptors, with the 

20 first discovered Type II receptor (i.e., the mouse-derived 
activin receptor) being designated ActRII, while 
subsequently identified Type II receptors which are not 
homologs of ActRII (because while clearly related by size 
and some sequence homology, they differ sufficiently to be 

25 considered as variants of ActRII) , are designated ActRIIB, 
ActRIIC, etc. 

Presently preferred members of the invention 
superfamily of receptors are further characterized by 
30 having a greater binding affinity for activins than for 
inhibins . Such receptors are frequently also observed to 
have ; 

substantially no binding affinity for transforming 
growth factors-/?, and 
35 substantially no binding affinity for non-activin-like 

proteins or compounds. 
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Additional members of the invention superfamily 
of receptors are further characterized by having a greater 
binding affinity for inhibins than for activins or TGF-/3s. 

5 Additional members of the invention superfamily 

of receptors are further characterized by having a greater 
binding affinity for TGF-/?s than for activins or inhibins. 

As employed herein, "activin" refers to activin 
10 A (a homodimer of two inhibin 0 A subunits) , activin B (a 
homodimer of two inhibin j8 B subunits) , activin AB (a 
heterodimer composed of one inhibin /3 A subunit and one 
inhibin 0 B subunit) ; "inhibin" refers to inhibin A (composed 
of the inhibin a subunit and an inhibin {3 A subunit) , inhibin 
15 B (composed of the inhibin a subunit and an inhibin f3 B 
subunit) ; "transforming growth factor /3 or TGF-jS" refers to 
TGF-jSl (a homodimer of two TGF-/31 subunits) , TGF-JS2 (a 
homodimer of two TGF-/J2 subunits) , TGF-J83 (a homodimer of 
two TGF-/33 subunits) , TGF-/34 (a homodimer of two TGF-/34 
20 subunits) , TGF-/85 (a homodimer of two TGF-/35 subunits) , 
TGF-/81.2 {a heterodimer of one TGF-jSl subunit and one 
TGF-/J2 subunit), and the like. 

Transforming growth factors -0 (TGF-/?s) are 
25 members of the activin/TGF-# superfamily of polypeptide 
growth factors. TGF-jSs are structurally related to 
activins, sharing at least 20-30% amino acid sequence 
homology therewith. TGF-fis and activins have a 

substantially similar distribution pattern of cysteine 
30 residues (or substitution) throughout the peptide chain. 
Furthermore, both polypeptides, in their active forms, are 
dimeric species. 

As employed herein, the term "non-activin-like" 
35 proteins refers to any protein having essentially no 
structural similarity with activins (as defined broadly 
herein) . 
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Pref erred members of the invention superfamily of 
receptors comprise those having in the range of about 500 
amino acids , and are further characterized by having the 
following designated sizes for each of the domains thereof, 
5 reading from the N- terminal end of said receptor: 

the extracellular, ligand-binding domain 
preferably will have in the range of about 88-118 
amino acids, 

the hydrophobic, trans -membrane domain preferably 
10 will have in the range of about 23-28 amino acids, 

beginning at the carboxy terminus of the extracellular 
domain, and 

the intracellular domain having kinase- like 
activity preferably will have in the range of about 
15 345-360 amino acids, beginning at the carboxy terminus 

of the hydrophobic, trans -membrane domain. 

Receptors of the invention optionally further 
comprise a second hydrophobic domain having in the range of 

20 about 16-3 0 amino acids at the extreme amino terminus 
thereof (i.e., at the amino terminus of the extracellular, 
ligand-binding domain) . This domain is a secretion signal 
sequence, which aids the transport of invention receptor (s) 
across the cell membrane. Exemplary secretion signal 

25 sequences include amino acids 1-19 of Sequence ID No. 1, 
amino acids 1-20 of Sequence ID No. 3, amino acids 1-25 of 
Sequence ID No. 11 / and the like. Such secretion signal 
sequences can be encoded by such nucleic acid sequences as 
nucleotides 71-127 of Sequence ID No. 1, nucleotides 468- 

30 527 of Sequence ID No. 3, nucleotides 72-146 of Sequence ID 
No. ll # and the like. 

Members of the invention superfamily of receptors 
can be obtained from a variety of sources, such as, for 
35 example, pituitary cells, placental cells, hematopoietic 
cells, brain cells, gonadal cells, liver cells, bone cells, 
muscle cells, endothelial cells, epithelial cells, 
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mesenchymal cells, kidney cells, and the like. Such cells 
can be derived from a variety of organisms, such as, for 
example, human, mouse, rat, ovine, bovine, porcine, frog, 
chicken, fish, mink, and the like* 

5 

Presently preferred amino acid sequences encoding 
receptor proteins of the invention include the sequence set 
forth in Sequence ID No, 2 (which represents a mouse 
activin receptor amino acid sequence) , a modified form of 

10 Sequence ID No, 2 wherein the arginine at residue number 39 
is replaced by a lysine, the isoleucine at residue number 
92 is replaced by a valine, and the glutamic acid at 
residue number 288 is replaced by a glutamine {which 
modified form of Sequence ID No. 1 is referred to 

15 hereinafter as "Sequence ID No. 1' " , and represents a human 
activin receptor amino acid sequence) , the sequence set 
forth as Sequence ID No. 4 (which represents a Xenopus 
activin receptor amino acid sequence) , and Sequence ID No. 
12 (which represents a rat activin receptor- like kinase 

20 amino acid sequence) as well as functional, modified forms 
thereof. Those of skill in the art recognize that numerous 
residues of the above -described sequences can be 
substituted with other, chemically, sterically and/or 
electronically similar residues without substantially 

25 altering the biological activity of the resulting receptor 
species . 

In accordance with another embodiment of the 
present invention, there is provided a soluble, 
30 extracellular, ligand-binding protein, further 
characterized by: 

having sufficient binding affinity for at least one 
member of the activin/TGF-jS superfamily of polypeptide 
growth factors such that concentrations of s 10 nM of said 
35 polypeptide growth factor occupy a 50% of the binding sites 
on said receptor protein, and 

having at least about 30% sequence identity with 
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respect to: 

the sequence of amino acids 20-134 set forth in 
Sequence ID No . 2 ; 

the sequence of amino acids 20-134 set forth in 
Sequence ID No. 2, wherein the arginine residue at 
position number 3 9 is replaced by a lysine, and the 
isoleucine at residue number 92 is replaced by a 
valine; 

the sequence of amino acids 21-132 set forth in 
Sequence ID No . 4 ; or 

the sequence of amino acids 26-113 set forth in 
Sequence ID No. 12. 

Presently preferred soluble, extracellular, 
ligand-binding proteins contemplated by the present 
invention can be further characterized by having at least 
about 50% sequence identity with respect to: 

the sequence of amino acids 20-134 set forth in 
Sequence ID No. 2; 

the sequence of amino acids 20-134 set forth in 
Sequence ID No. 2, wherein the arginine residue at 
position number 39 is replaced by a lysine, and the 
isoleucine at residue number 92 is replaced by a 
valine; 

the sequence of amino acids 21-132 set forth in 
Sequence ID No. 4; or 

the sequence of amino acids 26-113 set forth in 
Sequence ID No. 12; 
with the presently most preferred soluble, extracellular, 
ligand-binding proteins having at least about 80% sequence 
identity with respect to the above -referenced fragments of 
Sequence ID Nos. 2, 4 or 12 . 

Members of the class of soluble, ligand-binding 
proteins contemplated by the present invention may be 
divided into various subclasses, as previously described, 
wherein members of one subclass may have a greater binding 
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affinity for activins than for inhibins and/or TGF-/3S; or 
alternatively, members of another subclass may have a 
greater binding affinity for inhibins than for activins 
and/or TGF-/3s; or alternatively, members of yet another 
5 subclass may have a greater binding affinity for TGF-jSs 
than for activins and/or inhibins* It is, of course, 
understood by those of skill in the art, that members of 
more than one subclass may have a greater binding affinity 
for one member of the activin/TGF-/? superfamily of 
10 polypeptide growth factors, relative to other members of 
the superfamily. 

Presently preferred soluble, extracellular, 
ligand-binding proteins of the present invention are 
15 further characterized by: 

having a greater binding affinity for activins than 
for inhibins, 

having substantially no binding affinity for 
transforming growth factors-jS, and 
20 having substantially no binding affinity for 

non-activin-like proteins . 

Presently preferred soluble, extracellular, 
ligand-binding proteins of the present invention typically 
25 comprise in the range of about 88-118 amino acids. 

Especially preferred soluble, extracellular, 
ligand-binding proteins of the invention are those having 
substantially the same amino acid sequence as that set 
3 0 forth as: 

residues 20-134 of Sequence ID No. 2; 
residues 20-134 of Sequence ID No. 2, wherein the 
arginine residue at position number 3 9 is replaced by 
a lysine, and the isoleucine at residue number 92 is 
35 replaced by a valine; 

residues 21-132 of Sequence ID No. 4; or 
residues 26-113 of Sequence ID No. 12. 
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As employed herein, the term "substantially the 
same amino acid sequence" refers to amino acid sequences 
having at least about 80% identity with respect to the 
reference amino acid sequence , and will retain comparable 
5 functional and biological properties characteristic of the 
protein encoded by the reference amino acid. Preferably, 
proteins having "substantially the same amino acid 
sequence" will have at least about 90% amino acid identity 
with respect to the reference amino acid sequence; with 
10 greater than about 95% amino acid sequence identity being 
especially preferred. 

The above -described soluble proteins can be 
employed for a variety of therapeutic uses, e.g., to block 

15 receptors of the invention from affecting processes which 
the receptors would otherwise mediate . The presence of the 
soluble proteins of the invention will compete with 
functional ligand for the receptor, preventing the 
formation of a functional receptor-ligand complex, thereby 

20 blocking the normal regulatory action of the complex. 

In accordance with yet another embodiment of the 
present invention, there are provided antibodies generated 
against the above -described soluble proteins and receptor 
25 proteins. Such antibodies can be employed for diagnostic 
applications, therapeutic applications, and the like. 
Preferably, for therapeutic applications, the antibodies 
employed will be monoclonal antibodies. 

30 The above-described antibodies can be prepared 

employing standard techniques, as are well known to those 
of skill in the art, using the invention receptor proteins 
as antigens for antibody production. 

35 In accordance with still another embodiment of 

the present invention, there are provided methods for 
modulating the transcription trans -activation of 
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receptor(s) of the invention by contacting said receptor (s) 
with a modulating, effective amount of the above -described 
antibodies. 

5 The soluble proteins of the invention, and the 

antibodies of the invention, can be administered to a 
subject employing standard methods, such as, for example, 
by intraperitoneal, intramuscular, intravenous, or 
subcutaneous injection, implant or transdermal modes of 
10 administration, and the like. In addition, methods such as 
transfection with viral or retroviral vectors encoding the 
invention compositions . One of skill in the art can 
readily determine dose forms, treatment regiments, etc, 
depending on the mode of administration employed. 

15 

In accordance with a further embodiment of the 
present invention, there are provided DNA sequences which 
encode the above-described soluble proteins and receptor 
proteins. Optionally, such DNA sequences, or fragments 
20 thereof, can be labeled with a readily detectable 
substituent (to be used, for example, as a hybridization 
probe) . 

The above -de scribed receptor (s) can be encoded by 
25 numerous DNA sequences, e.g., a DNA sequence having a 
contiguous nucleotide sequence substantially the same as: 
nucleotides 128 - 1609 of Sequence ID No. 1 
(which encodes a mouse activin receptor) ; 

variations of nucleotides 128 - 1609 of Sequence 
30 ID No. 1, wherein the codon for residue number 39 of 

the encoded amino acid codes for lysine, the codon for 
residue number 92 of the encoded amino acid codes for 
valine, and the codon for residue number 288 of the 
encoded amino acid encodes glutamine (which encodes a 
35 human activin receptor) ; 

nucleotides 528 - 1997 of Sequence ID No. 3 
(which encodes a Xenopus activin receptor) ; 
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nucleotides 147 - 1550 of Sequence ID No. 11 
(which encodes a rat activin receptor) ; or 

variations of any of the above sequences which 
encode the same amino acid sequences, but employ 
5 different codons for some of the amino acids. 

As employed herein, the term "substantially the 
same as" refers to DNA having at least about 70% homology 
with respect to the nucleotide sequence of the DNA fragment 
10 with which subject DNA is being compared. Preferably , DNA 
"substantially the same as" a comparative DNA will be at 
least about 80% homologous to the comparative nucleotide 
sequence; with greater than about 90% homology being 
especially preferred. 

15 

Another DNA which encodes a receptor of the 
invention is one having a contiguous nucleotide sequence 
substantially the same as: 

nucleotides 71 - 1609 of Sequence ID No* 1 (which 
20 encodes a precursor- form of a mouse activin receptor) ; 

variations of nucleotides 71 - 1609 of Sequence 
ID No. 1, wherein the codon for residue number 39 of 
the encoded amino acid codes for lysine, the codon for 
residue number 92 of the encoded amino acid codes for 
25 valine, and the codon for residue number 288 of the 

encoded amino acid encodes glutamine (which encodes a 
precursor- form of a human activin receptor) ; 

nucleotides 468 - 1997 of Sequence ID No. 3 
(which encodes a precursor form of a Xenopus activin 
30 receptor) ; 

nucleotides 72 - 1550 of Sequence ID No. 11 
(which encodes a precursor form of a rat activin 
receptor) ; or 

variations of any of the above sequences which 
35 encode the same amino acid sequences, but employ 

different codons for some of the amino acids. 
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Yet another DNA which encodes the above -described 
receptor is one having a contiguous nucleotide sequence 
substantially the same as set forth in Sequence ID No. 1, 
Sequence ID No. 1', Sequence ID No. 3, or Sequence ID No. 
5 11. 

In accordance with a further embodiment of the 
present invention, the receptor-encoding cDNAs can be 
employed to probe library(ies) (e.g., cDNA, genomic, and 

10 the like) for additional sequences encoding novel receptors 
of the activin/TGP-j8 superfamily. Such screening is 
initially carried out under low- stringency conditions, 
which comprise a temperature of less than about 42°C, a 
formamide concentration of less than about 50%, and a 

15 moderate to low salt concentration. Presently preferred 
conditions for such screening comprise a temperature of 
about 37°C, a formamide concentration of about 20%, and a 
salt concentration of about 5X standard valine citrate 
(SSC; 20X SSC contains 3M sodium chloride, 0.3M sodium 

20 citrate, pH 7.0). Such conditions will allow the 
identification of sequences which have a substantial degree 
of similarity with the probe sequence, without requiring 
perfect homology for the identification of a stable hybrid. 
The phrase "substantial similarity" refers to sequences 

25 which share at least 50% homology. Preferably, 
hybridization conditions will be selected which allow the 
identification of sequences having at least 70% homology 
with the probe, while discriminating against sequences 
which have a lower degree of homology with the probe. 

30 

In accordance with yet another embodiment of the 
present invention, there is provided a method for the 
recombinant production of receptor (s) of the invention by 
expressing the above -de scribed DNA sequences in suitable 
35 host cells. 

The use of a wide variety of recombinant 
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organisms has been described for the production of 
peptides. One of skill in the art can readily determine 
suitable hosts (and expression conditions) for use in the 
recombinant production of the peptides of the present 
5 invention. Yeast hosts, bacterial hosts, mammalian hosts, 
and the like can be employed. Regulatory sequences capable 
of controlling the expresseion of invention peptides are 
well known for each of these host systems, as are growth 
conditions under which expression occurs. 

10 

In accordance with a further embodiment of the 
present invention, there is provided a binding assay 
employing receptors of the invention, whereby a large 
number of compounds can be rapidly screened to determine 
15 which compounds, if any, are capable of binding to the 
receptors of the invention. Then, more detailed assays can 
be carried out with those compounds found to bind, to 
further determine whether such compounds act as agonists or 
antagonists of invention receptors. 

20 

Another application of the binding assay of the 
invention is the assay of test samples (e.g., biological 
fluids) for the presence or absence of members of the 
activin/TGF-/8 superf amily of polypeptide growth factors. 

25 Thus, for example, serum from a patient displaying symptoms 
related to pathway (s) mediated by members of the 
activin/TGF-jS superfamily of polypeptide growth factors can 
be assayed to determine if the observed symptoms are 
perhaps caused by over- or under-production of such 

30 polypeptide growth factor. 

The binding assays contemplated by the present 
invention can be carried out in a variety of ways, as can 
readily be identified by one of skill in the art. For 
35 example, competitive binding assays can be employed, as 
well as radioimmunoassays, ELISA, ERMA, and the like. 
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In accordance with a still further embodiment of 
the present invention there are provided bioassays for 
evaluating whether test compounds are capable of acting as 
agonists or antagonists of receptor (s) of the present 
5 invention. 



The bioassays of the present invention involve 
evaluating whether test compounds are capable of acting as 
either agonists or antagonists for members of the invention 
10 superfamily of receptors, or functional modified forms of 
said receptor protein {s) . The bioassay for evaluating 
whether test compounds are capable of acting as agonists 
comprises: 

(a) culturing cells containing: 
15 DNA which expresses said receptor 

protein (s) or functional modified forms of 
said receptor protein (s), and 

DNA encoding a hormone response element 
operatively linked to a reporter gene; 
20 wherein said culturing is carried out in the 

presence of at least one compound whose ability 
to induce transcription activation activity of 
receptor protein is sought to be determined, and 
thereafter 

25 (b) monitoring said cells for expression of the 

product of said reporter gene. 



The bioassay for evaluating whether test 
compounds are capable of acting as antagonists for 
30 receptor (s) of the invention, or functional modified forms 
of said receptor(s), comprises: 

(a) culturing cells containing: 

DNA which expresses said receptor 
protein (s) , or functional modified forms of 
35 said receptor protein (s) , and 

DNA encoding a hormone response element 
operatively linked to a reporter gene 
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wherein said culturing is carried out in the 
presence of : 

increasing concentrations of at least 
one compound whose ability to inhibit 
5 transcription activation of said receptor 

protein (s) is sought to be determined, and 

a fixed concentration of at least one 
agonist for said receptor protein (s) , or 
functional modified forms of said receptor 
10 protein(s); and thereafter 

(b) monitoring in said cells the level of expression 
of the product of said reporter gene as a 
function of the concentration of said compound, 
thereby indicating the ability of said compound 
15 to inhibit activation of transcription. 

Host cells contemplated for use in the 
bioassay(s) of the present invention, include CV-1 cells, 
COS cells, and the like; reporter and expression plasmids 
20 employed typically also contain the origin of replication 
of SV-40; and the reporter and expression plasmids employed 
also typically contain a selectable marker. 

The hormone response element employed in the 
25 bioassay(s) of the present invention can be selected from, 
for example, mouse mammary tumor virus long terminal repeat 
(MTV LTR) , mammalian growth hormone promoter, and the 
reporter gene can be selected from chloramphenicol 
acetytransf erase (CAT) , lucif erase, j8-galactosidase, and 
30 the like. 

The cells can be monitored for the level of 
expression of the reporter gene in a variety of ways, such 
as, for example, by photometric means [e.g., by colorimetry 
35 (with a colored reporter product such as /?-galactosidase) , 
by fluorescence (with a reporter product such as 
lucif erase) , etc], by enzyme activity, and the like. 
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Compounds contemplated for screening in 
accordance with the invention bioassays include activin- or 
TGF-/J-like compounds , as well as compounds which bear no 
particular structural or biological relatedness to activin 
5 or TGF-j3. 

As employed herein , the phrase "activin- or 
TGF-£-like compounds" includes substances which have a 
substantial degree of homology (at least 20% homology) with 

10 the amino acid sequences of naturally occurring mammalian 
inhibin alpha and j3 A or (3 B chains {either singly or in any 
combination) as well as alleles, fragments, homologs or 
derivatives thereof which have substantially the same 
qualitative biological activity as mammalian inhibin, 

15 activin, or TGF-jS. Examples of activin- or TGF-/J-like 
compounds include activin A (a homodimer of two inhibin j3 A 
subunits) , activin B (a homodimer of two inhibin )8 B 
subunits) , activin AB (a heterodimer composed of one 
inhibin j8 A subunit and one inhibin j3 B subunit) , inhibin A 

20 (composed of the inhibin a subunit and an inhibin j8 A 
subunit) , inhibin B (composed of the inhibin a subunit and 
an inhibin jfi B subunit) , TGF-jSl (a homodimer of two TGF-01 
subunits) , TGF-J82 (a homodimer of two TGF-/32 subunits) , 
TGF-j33 (a homodimer of two TGF-£3 subunits) , TGF-04 (a 

25 homodimer of two TGF-/34 subunits) , TGF-/35 (a homodimer of 
two TGF-j85 subunits) , TGF-j8l*2 (a heterodimer of one TGF-j8l 
subunit and one TGF-£2 subunit), and the like. 

Examples of compounds which bear no particular 
30 structural or biological relatedness to activin or TGF-jS, 
but which are contemplated for screening in accordance with 
the bioassays of the present invention, include any 
compound that is capable of either blocking the action of 
the invention receptor peptides, or promoting the action of 
35 the invention receptor peptides, such as, for example, 
alkaloids and other heterocyclic organic compounds, and the 
like . 
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The method employed for cloning the receptor (s) 
of the present Invention involves expressing, in mammalian 
cells, a cDNA library of any cell type thought to respond 
to members of the activin/TGF-/J superfamily of polypeptide 
5 growth factors (e.g., pituitary cells, placental cells, 
fibroblast cells, and the like) . Then, the ability of the 
resulting mammalian cells to bind a labeled receptor ligand 
(i.e., a labeled member of the activin/TGF-0 superfamily of 
polypeptide growth factors) is determined. Finally, the 
10 desired cDNA insert (s) are recovered, based on the ability 
of that cDNA, when expressed in mammalian cells, to induce 
(or enhance) the binding of labeled receptor ligand to said 
cell . 

15 In addition to the above-described applications 

of the receptor proteins and DNA sequences of the present 
invention, the receptor or receptor-encoding compositions 
of the invention can be used in a variety of ways. For 
example, since activin is involved in many biological 

20 processes, the activin receptor (or antibodies thereto) can 
be applied to the modulation of such biological processes . 
For example, the stimulation of FSH release by activin can 
either be enhanced (for example, by supplying the subject 
with increased amounts of the activin receptor, relative to 

25 the amount of endogenous receptor, e.g., by transfecting 
the subject with a tissue specific activin-encoding 
construct), or depressed (e.g., by administration to a 
subject of antibodies to the activin receptor, thereby 
preventing formation of activin-receptor complex, which 

30 would then act to stimulate the release of FSH) . Thus, the 
compositions of the present invention can be applied to the 
control of fertility in humans, domesticated animals, and 
animals of commercial interest. 

35 As another example, the effect of activin on 

mitosis of red and white blood cells can be modulated, for 
example, by administering to a subject (employing suitable 
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means of administration) a modulating, effective amount of 
activin receptor (which would enhance the ability of 
activin present in the cell to modulate mitosis) . 
Alternatively, one could administer to a subject an 
5 antibody to the activin receptor (or a portion thereof) , 
which would reduce the effect of activin by blocking the 
normal interaction between activin and activin receptor. 

As additional examples of the wide utility of the 
10 invention compositions, receptors and/or antibodies of the 
invention can be used in such areas as the diagnosis and/or 
treatment of activin -dependent tumors, enhancing the 
survival of brain neurons, inducing abortion in livestock 
and other domesticated animals, inducing twinning in 
15 livestock and other domesticated animals, and so on. 

As still further examples of the wide utility of 
the invention compositions, agonists identified for TGF-/3 
specific receptors can be used to stimulate wound healing, 

20 to suppress the growth of TGF-jS- sensitive tumors, to 
suppress immune response (and thereby prevent rejection of 
transplanted organs), and the like. Antagonists or the 
soluble, ligand- binding domain derived from TGF-jff receptors 
can be used to block endogenous TGF-jS, thereby promoting 

25 liver regeneration and stimulating some immune responses. 

It can be readily seen, therefore, that the 
invention compositions have utility in a wide variety of 
diagnostic, clinical, veterinary and research applications. 

30 

The invention will now be described in greater 
detail by reference to the following non-limiting examples. 
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EXAMPLES 

Recombinant human (rh) activin A, rh activin B, 
and rh inhibin A were generously provided by Genentech, 
5 Inc. Porcine TGF-/31 was obtained from R+D Systems. 

Double -stranded DNA was sequenced by the dideoxy 
chain termination method using the Sequenase reagents from 
US Biochemicals . Comparison of DNA sequences to databases 
10 was performed using the PASTA program [Pearson and Lipman, 
Proc. Natl. Acad. Sci. USA 85.: 2444-2448 (1988)]. 

EXAMPLE I 

Construction and Subdivision of AtT20 cDNA Library 

15 

Polyadenylated RNA was prepared from AtT20 cells 
using the Fast Track reagents from InVitrogen. cDNA was 
commercially synthesized and ligated into the plasmid 
vector pcDNAl using non-palindromic BstXI linkers, yielding 
20 a library of approximately 5xl0 6 primary recombinants. The 
unamplif ied cDNA library was plated at 1000 clones per 100 
mm plate, then scraped off the plates, frozen in glycerol 
and stored at -70°. 

25 Activin suppresses adrenocorticotrophic hormone 

(ACTH) secretion by both primary anterior pituitary cell 
cultures [Vale et al . , Nature 321: 776-779 (1986)] and 
AtT20 mouse corticotropic cells. Because AtT20 cells 
possess activin receptors indistinguishable from those on 

3 0 other cell types (based on binding affinity measurements 
with activin A) , these cells were chosen to be the source 
of cDNA for transf ection. A cDNA library of approximately 
5xl0 6 independent clones from AtT20 cells was constructed in 
the mammalian expression vector, pcDNAl, and screened using 

35 an expression cloning approach [Gearing et al . , EMBO J. 8., 
3667-3676 (1989)] based on the ability to detect activin 
binding to single transf ected cells. The library was 
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divided into pools of 1000 clones, DNA was prepared from 
each pool of clones and transiently transfected into COS 
cells, and the cells screened for the capacity to bind 
iodinated activin A. Binding was assessed by performing 
5 the transf ections and binding reactions directly on 
chambered microscope slides, then dipping the slides in 
photographic emulsion and analyzing them under a 
microscope. Cells which had been transfected with an 
activin receptor cDNA, and consequently bound radioactive 

10 activin, were covered with silver grains. DNA from pools 
of clones were analyzed either singly or in groups of 
three. Of 300 pools (approximately 300,000 clones) assayed 
in this manner, one group of three generated two positive 
cells when transfected into COS cells. The positive pool 

15 (#64) was identified by transfecting and analyzing DNA from 
each pool of 1000 singly, and then was further fractionated 
until a single clone (pmActRl) was purified which generated 
>10 4 positive cells after transf ection (see Table 1) . 

20 Table 1 

Purification of the activin receptor clone from 
the AtT2Q library 

Pool Clones /pool Positive cells/slide 

25 62,63,64 3x1000 2 

64 1000 1-3 

64-51 400 4-10 

64-51-R10;64-51-C13 20 25-40 

pmActRl 1 >10 4 

30 

The total number of transfected cells capable of binding 125 I 
activin A in a field of 2x10 s COS cells was counted for 
pools of clones at each stage of the purification process. 



35 



pmActRl contained a 1.7 kb insert, coding for a 
protein of 342 amino acids (Figure 3); however, it was 
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incomplete on the 3' end, thus the last 17 amino acids were 

encoded by vector sequences. In order to obtain the entire 
sequence, the AtT20 library was rescreened by hybridization 
with the 1.6 kb SacI-PstI fragment (Figure 3), Screening 
5 6xl0 5 colonies yielded one additional positive clone 
(pmActR2) which had a 2.6 kb insert and contained the 
entire coding sequence for the mouse activin receptor 
(Figure 3) . The nucleic acid sequence and the deduced 
amino acid sequence of the insert in pmActR2 are set forth 
10 in Sequence ID No. 1. 

EXAMPLE II 
COS Cell Transfection 

15 Aliquots of the frozen pools of clones from 

Example I were grown overnight in 3 ml cultures of terrific 
broth, and mini-prep DNA prepared from 1.5 ml using the 
alkaline lysis method [Maniatis et al. Molecular Cloning 
(Cold Spring Harbor Laboratory (1982)]. 1/10 of the DNA 

20 from a mini-prep (10 Ml of 100 Ml) was used for each 
transfection . 

2x10 s COS cells were plated on chambered 
microscope slides (1 chamber - Nunc) that had been coated 

25 with 20 fxg/ml poly-D- lysine and allowed to attach for at 
least 3 hours. Cells were subjected to DEAE-Dextran 
mediated transfection as follows. 1.5 ml of serum-free 
Dulbecco's Modified Eagle's medium (DME) containing 100 mM 
chloroquine was added to the cells, DNA was precipitated 

30 in 200 ml DME/ chloroquine containing 500 mg/ml DEAE- 
Dextran, then added to the cells. The cells were incubated 
at 3 7° for 4 hours, then the media was removed and the 
cells were treated with 10% DMSO in HEPES buffered saline 
for 2 minutes. Fresh media was added and the cells assayed 

3 5 3 days later. For transf ections with the purified clone, 
2.5xl0 6 cells were transf ected in 100 mm dishes with 5 /xg 
purified DNA. The total transfection volume was 10 ml, and 
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the DNA was precipitated in 400 fil . 



EXAMPLE III 
Binding Assay 

5 

Cells were washed 2x with HEPES buffered saline 
(HDB) containing 0.1% BSA, then incubated for 90 minutes at 
22° in 0.5 ml HDB, 0.1% BSA containing 7xl0 5 cpm 125 I activin 
A (approximately 7 ng, 500 pM) . The cells were then washed 

10 3X with cold HDB, fixed for 15 minutes at 22° in 2.5% 
glutaraldehyde/HDB and washed 2X with HDB. The chambers 
were then peeled off the slides, and the slides dehydrated 
in 95% ethanol, dried under vacuum / dipped in NTB2 
photographic emulsion (Kodak) and exposed in the dark at 4° 

15 for 3 days. Following development of the emulsion, the 
slides were dehydrated in 95% ethanol, stained with eosin 
and coverslipped with DPX mountiant (Electron Microscopy 
Sciences) . The slides were analyzed under darkf ield 
illumination using a Leitz microscope. 

20 

EXAMPLE IV 
Subdivision of Positive Pool 

Of 300 pools screened (each pool containing about 
25 1000 cDNAs) , one positive pool (#64) , which produced two 
positive cells, was identified. Bacteria from the frozen 
stock of this positive pool (#64) were replated at 
approximately 400 clones per plate, replica plates were 
made, and DNA was prepared from each subpool and analyzed 
30 employing the binding assay described above. Several 
positive subpools were found, which generated from 4-10 
positive cells per slide. The bacteria from the replica 
plate of one positive subpool were picked onto a grid, and 
DNA prepared from pools of clones representing all the rows 
35 and all the columns, as described by Wong [Science 228 : 810- 
815 (1985)]. The identification of one positive row and 
one positive column unambiguously identified a single 
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clone, which when transfected yielded >10 4 positive 
cells/2xlO s cells. 

EXAMPLE V 

5 Radioreceptor Assay 

10 5 COS cells transfected with either pmActRl or 
pmActR2 / or 10 6 untransf ected COS cells, were plated in 6 
well dishes and allowed to grow overnight. The cells were 

10 washed 2X with HDB, 0.1% BSA, and incubated at 22° for 90 
minutes in 0.5 ml HDB, 0*1% BSA containing 100,000 cpm 
(approximately 1 ng, 75 pM) 125 I activin A (5 pig activin A 
was iodinated by chloramine T oxidation to a specific 
activity of 50-90 ftCi/jig; iodinated activin A was purified 

15 on a 0.7x20 cm G-25 column) and varying amounts of 
unlabeled competitor hormone. Following binding, the cells 
were washed 3X with cold HDB, solubilized in 0.5 ml 0.5 N 
NaOH, removed from the dish and radioactivity was measured 
in a gamma counter. Data presented in Figure 5 are 

20 expressed as % specific binding, where 100% specific 
binding is the difference between binding in the absence of 
competitor and binding in the presence of a 100 fold molar 
excess of unlabeled activin A. Binding parameters were 
determined using the program LIGAND [Munson P.J. and 

25 Rodbard, D. , Anal. Biochem. 107 :220-259 (1980)]. 

EXAMPLE VI 
Chemical Cross -linking 

3 0 2xl0 € COS cells, or 5xl0 6 AtT20 cells, were washed 

2x with HDB, scraped off the dish, incubated for 90 minutes 
at 22° under constant rotation in 0.5 ml HDB containing 
7x1 0 5 cpm (approximately 500 pM) 125 I activin A with or 
without 500 ng (37 rM) unlabeled activin A. Cells were 

35 diluted with 1 ml HDB, pelleted by centrif ugation and 
resuspended in 0.5 ml HDB. Disuccinimidyl suberate (DSS; 
freshly dissolved in DMSO) was added to 500 fiM, and the 
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cells incubated at 0° for 30 minutes. The cross-linking 
was terminated by addition of 1 ml 50 mM Tris-HCl pH 7.5 / 
100 mM NaCl, then the cells were pelleted by 
centrifugation, resuspended in 100 fil 50 mM Tris-HCl pH 
5 7.5, 1% Triton X-100 and incubated at 0° for 60 minutes. 
The samples were centrifuged 5 minutes at 13,000xg, and the 
Triton-soluble supernatants analyzed by SDS-PAGE using 8.5% 
polyacrylamide gels. The gels were dried and subjected to 
autoradiography for 4-14 days. 

10 

EXAMPLE VII 
RNA Blot Analysis 



Total RNA was purified from tissue culture cells 
15 and tissues using LiCl precipitation. 20 (xg total RNA was 
run on 1.2% agarose, 2.2M formaldehyde gels, blotted onto 
nylon membranes (Hybond - NEN) , and hybridized with a 0.6 
kb Kpnl fragment (see Figure 3) which had been labeled with 
32 P by random priming using reagents from US Biochemicals . 
20 Hybridization was performed at 42° in 50% formamide, and 
the filters were washed at 65° in 0.2X SSC. 



EXAMPLE VIII 
Sequence Analysis 

25 

Full length mouse activin receptor clone encodes 
a protein of 513 amino acids, with a 5' untranslated region 
of 70 bp and a 3' untranslated region of 951 bp. pmActR2 
does not contain a poly A tail, although it does have a 

30 potential poladenylylation site at bp 2251. The insert in 
clone pmActRl had an additional 551 bp of 5' untranslated 
sequence, was identical in the overlapping range, and 
stopped at the 3' end at base 1132 of pmActR2. The first 
methionine codon (ATG) , at bp 71, in pmActR2 is in a 

35 favorable context for translation initiation (Kozak, M. , 
Nucl. Acids Res. 15:8125-8148 (1987)], and is preceded by 
an in- frame stop codon. pmActRl contains 3 additional ATGs 
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in the 5' untranslated region; however, none of these is in 
an appropriate context for initiation, and all are followed 
by in-frame stop codons . While this unusually long 5' 
leader sequence may have functional significance, it is 
5 clearly not necessary for proper expression, because 
pmActR2, which lacks most of that sequence, can be 
functionally expressed in COS cells (see below) . 

Hydropathy analysis using the method of Kyte and 

10 Doolittle [J. Mol. Biol. 157:105-132 (1982)] revealed two 
hydrophobic regions: a 10 amino acid stretch at the amino 
terminus assumed to be a single peptide, and a single 
putative 26 residue membrane -spanning region between amino 
acids 119-142 (see Figure 1 and Sequence ID No. 2) . The 

15 signal peptide contains the conserved n-, h- and c- domains 
common to signal sequences; the site of cleavage of the 
signal peptide, before Ala 1 , is predicted based on rules 
described by von Heijne [Biochim. Biophys . Act. 947:307-333 
(1988)]. As is common for the cytoplasmic side of 

20 membrane -spanning domains, the predicted transmembrane 
region is closely followed by two basic amino acids. The 
mature mouse activin receptor is thus predicted to be a 4 94 
amino acid type I membrane protein of Mr 54 kDa, with a 116 
amino acid N- terminal extracellular ligand binding domain, 

25 and a 346 amino acid intracellular signalling domain. 

Comparision of the activin receptor sequence to 
the sequence databases revealed structural similarity in 
the intracellular domain to a number of receptor and non- 
30 receptor kinases. Analysis of the sequences of all kinases 
has led to the identification of a 300 amino acid kinase 
domain characterized by 12 subdomains containing a number 
of highly conserved amino acids [Hanks, S.K. and Quinn, 
A.M., Meth. Enzymol. 200:38-62 (1991) and Hanks et al . , 
35 Science 241:42-52 (1988)]; the activin receptor sequence 
has all of these conserved subdomains in the proper order 
(Figure 4) . A conserved Gly in subdomain I is replaced by 
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Ala 180 in the activin receptor, but this residue has also 
been observed in other kinases. Based upon structural 
relatedness, therefore, this receptor is expected to be a 
functional protein kinase. 

5 

The sequences in two of these subdomains (VIB and 
VIII) can be used to predict tyrosine vs. serine/threonine 
substrate specificity [Hanks et al . , (1988) supra l . The 
sequence of the mouse activin receptor in both of these 
10 subdomains is characteristic of serine kinases. 
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Therefore, the activin receptor is expected to 
have serine/threonine specificity. Furthermore, the 
activin receptor does not have a tyrosine residue in the 
standard autophosphorylation region between subdomains VII 
5 and VIII, indicating that it is not a standard tyrosine 
kinase* The receptor could potentially autophosphorylate 
at Ser 333 or Thr 337 . One interesting additional possibility 
is that the activin receptor kinase may have specificity 
for serine, threonine and tyrosine residues. Several 
10 kinases with these properties have recently been described 
[see, for example, Howell et al., Mol. Cell. Biol. 11:568- 
572 (1991), Stern et al . , Mol. Cell. Biol. 11:987-1001 
(1991) and Featherstond, C. and Russell, P., Nature 
349 :808-811 (1991)] . 

15 

Phylogenetic analysis of the activin receptor 
compared to 161 other kinase sequences revealed that the 
activin receptor and the C.elegans protein, daf-1 [Georgi 
et al., Cell 61:635-645 (1990)] may constitute a separate 

20 subfamily of kinases (see Figure 6) . daf-1 is a putative 
transmembrane receptor involved in the developmental arrest 
of a non-feeding larval state and shares 32% identity with 
the activin receptor (see Figure 6) . Like the activin 
receptor, daf-1 is predicted to be a transmembrane 

25 serine/threonine- specif ic kinase; furthermore, both daf and 
the activin receptor have short, conserved inserts in the 
kinase domain sequence between subdomains VIA-VIB and X-XI 
that are not present in any other kinase (underlined in 
Figure 4B) . This additional similarity lends credence to 

3 0 their belonging to a unique subfamily of kinases. The 
activin receptor is quite distantly related (18% amino acid 
sequence identity) to the only other known transmembrane 
serine/ threonine protein kinase, encloded by the ZmPK gene 
of maize [Walker, J.C. and Zhang, R. , Nature 345:743-746 

35 (1990) ] . 
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The extracellular domain of the activin receptor 
did not show similarity to any other sequences in the 
databases. This ligand binding domain is relatively small 
in comparison to those found in other growth factor 
5 receptors, but like those receptors this domain has a high 
cysteine content. The pattern of these Cys residues, 
however, is not like either an immunoglobulin fold or the 
cysteine rich repeats of the EGF receptor. There are also 
two potential sites of N- linked glycosylation in the 
10 extracellular domain, as well as a number of potential 
phosphorylation sites for protein kinase C and casein 
kinase II in the intracellular domain. 

EXAMPLE IX 

15 Binding Properties of the Cloned Activin Receptor 

To verify that the cloned receptor is activin 
specific, competition binding experiments were performed on 
COS cells transiently transfected with either pmActRl or 

20 pmActR2 . Cells transfected with either construct bound 
activin A with a single high affinity component (Kd = 
180 pM; Figure 5) , indicating that a functional 
(structurally complete) intracellular kinase domain is not 
required for ligand binding. This binding affinity is 

25 consistent with that measured on other activin-responsive 
cell types [see, for example, Campen, C.A. and Vale, W., 
Biochem. Biophys. Res. Comm. 157 : 844-849 (1988); Hino et 
al., J. Biol. Chem. 264 : 10309-10314 (1989); Sugino et al . , 
J. Biol. Chem. 263 : 15249-15252 (1988); and Kondo et al . , 

30 Biochem. Biophys. Res. Coram, 161 : 1267-1272 (1989)]. 
Untransf ected COS cells do not bind activin A. The 
transfected cultures as a whole expressed approximately 
26,000 receptors per cell; however, because only 15% of the 
cells express the transfected gene (as measured by 

35 quantitating transfected cells as a fraction of all cells 
following dipping in emulsion) , each transfected cell 
expressed an average of 175,000 receptors per cell. The 
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level of expression per cell varies considerably, though, 
based on the number of accumulated silver grains. This 
value is comparable to the expression of other transfected 
cell surface proteins in COS cells. 

5 

Binding of iodinated activin A to COS cells 
transiently transfected with pmActR2 could be competed by 
activin B with slightly reduced potency compared to activin 
A; by inhibin A with approximately 10-fold lower potency; 

10 and could not be competed by TGF-/31 (Figure 5B) . This 
affinity and specificity of binding match those observed 
following binding of activin A to a number of other 
activin-responsive cell types. Although activin B appears 
to bind the transfected receptor with lower affinity than 

15 activin A, the activin B preparation used in these 
experiments may have suffered a reduction in potency, based 
on a comparison of bioactivity with activin A, since the 
recombinant synthesis of the activin B employed herein had 
been carried out some time ago [recombinant synthesis of 

20 activin B is described by Mason et al., in Mol . Endocrinol. 
3.: 1352-1358 (1989)]. It is likely that this cDNA encodes 
a receptor for multiple forms of activin. 

The size of the cloned activin receptor was 
25 analyzed by affinity cross-linking 125 I activin A to COS 
cells transfected with pmActR2 using the bifunctional 
chemical cross-linker, disuccinimidyl suberate (DSS) . A 
major cross -linked band of 84 kDa was observed in 
transfected, but not in untransf ected cells. Subtracting 
3 0 the molecular weight of activin, this represents a protein 
of 56 kDa, which corresponds well to the molecular weight 
predicted from the nucleic acid sequence data. Cross - 
linking 125 I activin A to AtT20 cells yields a major band of 
65 kDa, with minor bands of approximately 78 and 84 kDa. 
3 5 The size of the largest band matches that generated by the 
cloned receptor. The smaller bands could be either 
separate proteins, different phosphorylated forms of the 
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same protein, or degradation products of the full length 
clone; the sequences DKKRR at amino acid 35 and KKKR at 
amino acid 416 could be potential sites of proteolysis. 
Alternatively, these bands could come from alternatively 
5 spliced products of the same gene. 

The 84 and 65 kDa cross-linked bands have also 
been observed in other activin- responsive cell types [Hino, 
supra ; Centrella et al . , Mol . Cell. Biol. 11 : 250-258 

10 (1991)3, and interpreted to represent the signalling 
receptor, although complexes of other sizes have also been 
seen as well. The size of the activin receptor is very 
similar to a putative TGF-jS receptor, to the limited extent 
it has been characterized by chemical cross-linking [see 

15 Massague et al . , Ann. N.Y. Acad. Sci. 593 : 59-72 (1990)]. 

EXAMPLE X 
Expression of Activin Receptor mRNA 

20 The distribution of activin receptor mRNA was 

analyzed by Northern blot. Two mRNA species, of 6.0 and 
3.0 kb, were observed in AtT20 cells as well as a number of 
mouse tissues, including brain, testis, pancreas, liver and 
kidney. The total combined size of the inserts from 

25 pmActRl and pmActR2 is 3.1 kb, which corresponds to the 
size of the smaller transcript. Neither the extent of 
similarity between the two mRNAs, nor the significance of 
having two transcripts is clear. The genes for several 
other hormone receptors have been shown to be alternatively 

30 spliced to generate both a cell surface receptor and a 
soluble binding protein, and it is possible that the 
activin receptor is processed in a similar manner. 

Interestingly, the relative abundance of the two 
35 transcripts varies depending on the source. While AtT20 
cells have approximately equal levels of both mRNAs, most 
tissues had much greater levels of the 6.0 kb transcript, 
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with little or no expression of the 3.0 kb transcript. 
Testis, on the other hand, had a greater amount of the 3.0 
kb band. Expression of activin receptor mRNA in brain, 
liver and testis is in accord with described biological 
5 actions of activin in those tissues [Mine et al . , 
Endocrinol. 125 : 586-591 (1989); Vale et al . , Peptide Growth 
Factors and Their Receptors, Handbook of Experimental 
Pharmacology, M.A. Sporn and A*B + Roberts, ed. , 
Springer-Verlag (1990) , in press] . 

10 

EXAMPLE XI 

Identification of a Human Activin Receptor 

A human testis library (purchased from Clontech; 
15 catalog no, HLlOlOb) was probed with the full length mouse 
activin receptor gene (see Sequence ID No. 1) under the 
following conditions: 

Hybridization stringency: 

20% formamide, 6X SSC at 42°C; 
20 Wash stringency: 

2X SSC, 0.1% SDS at 42 °C. 

A sequence which is highly homologous with the 
mouse activin receptor was identified (Sequence ID No. 1') . 
25 Due to the high degree of homology between this receptor 
and the mouse activin receptor, this receptor is designated 
as the human form of the activin receptor from the same 
subclass as the mouse receptor described above . 

30 EXAMPLE XII 

Identification of a Xenopus Activin Receptor 

A Xenopus stage 17 embryo cDNA library (prepared 
as described by Kintner and Melton in Development .99: 311- 
35 325 (1987) was probed with the full length mouse activin 
receptor gene (see Sequence ID No. 1) under the following 
conditions : 
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Hybridization stringency: 

20% formamide, 6X SSC at 42 °C; 
Wash stringency: 

2X SSC, 0.1% SDS at 42°C. 

5 

A sequence having a substantial degree of 
homology with respect to the mouse activin receptor was 
identified (Sequence ID No. 3) . The degree of overall 
amino acid homology (relative to the mouse acitvin 

10 receptor) is only about 69% (with 77% homology in the 
intracellular domain and 58% homology in the extracellular 
domain) . Due to the moderate degree of homology between 
this receptor and the mouse activin receptor, this receptor 
is designated as an activin receptor from a different 

15 subclass than the mouse receptor described above. 

EXAMPLE XIII 
Functional Assays of ActRs in Xenopus embryos 

20 To determine whether xActRIIB can transmit a 

signal in response to activin, xActRIIB RNA was synthesized 
in vitro and injected into Xenopus embryos at two different 
concentrations- Injected embryos were allowed to develop 
to stage 9, at which time animal caps were dissected and 

25 treated overnight with different concentrations of activin. 
The xActRIIB cDNA was cloned into rp64T [see Krieg and 
Melton in Methods in Enzymology, Abelson and Simon, Eds. 
(Academic Press, New York, 1987), vol. 155, p. 397] and 
transcribed in vitro to generate a capped, synthetic 

30 xActRIIB RNA [see Melton et al . , in Nucleic Acids Res. 
12:7035 (1984) and Kintner in Neuron 1:545 (1988)]. 
Embryos at the two- to four-cell stage were injected with 
about 20 nl of RNA at concentrations of 0.02 ng/nl, or 0.1 
ng/nl, spread between four quadrants of the animal pole. 

35 At stage 9, animal caps were removed from RNA- injected 
embryos and incubated in 0.5x modified mammalian Ringer's 
(MMR) , 0.1% bovine serum albumin (BSA) with different 
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concentrations of purified, porcine activin A (six caps per 
incubation) . After 20 hours in culture, total RNA was 
prepared . 

5 The response of the caps to activin was assessed 

by quantifying muscle-specif ic actin RNA with a 
ribonuclease protection assay as per Blackwell and 
Weintraub, Science 250:1104 (1990) . Embryos injected with 
0.4 and 2.0 ng of xActRIIB RNA were approximately 10- and 
10 100 -fold more sensitive, respectively, to activin than 
control embryos. The low amount of muscle actin found in 
animal caps in the absence of added activin A is probably 
a consequence of contamination of the animal cap with a 
small amount of marginal zone tissue. 

15 

The amount of muscle actin decreased with 
increasing concentration of activin in the embryos injected 
with 2 ng of xActRIIB RNA. This is consistent with the 
observation that isolated animal cap cells uniformly 

20 exposed to different concentrations of activin only form 
muscle cells in response to a narrow range of activin 
concentrations [see Blackmann and Kadesch in Genes and 
Development 5.: 1057 (1990)]. The present results indicate 
that the concentration of ligand and the amount of receptor 

25 are both important in determining the signal transmitted. 
Thus, the range of activin concentrations that lead to 
muscle differentiation is lower in animal cap cells from 
injected embryos, which are expressing more receptor than 
normal, than from uninjected embryos. 

30 

EXAMPLE XIV 
Analysis of kinase activity of mActRII 

A fragment of cDNA corresponding to the entire 
3 5 intracellular domain of mActRII ( amino acids 143-494) was 
subcloned into the vector pGEX-2T [see Smith and Johnson in 
Gene 6J7:31-40 (1988)3 , creating a fusion protein between 
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glutathione S- transferase (GST) and the putative kinase 
domain of the receptor. This plasmid was introduced into 
bacteria and the expressed fusion protein was purified 
using glutathione affinity chromatography as described by 
5 Smith and Johnson. Approximately 100-200 ng of fusion 
protein, or of purified GST, were incubated with 25 /zCi 
ty- 32 P] ATPin a buffer containing 50 mM Tris, 10 mM MgCl 2 for 
30 minutes at 37°C. The products were analyzed by SDS-PAGE 
and autoradiography. The fusion protein, but not the GST 
10 alone, became phosphorylated, indicating that the kinase 
domain of the fusion protein was functional. Phosphoamino 
acid analysis, performed according to Cooper et al. [Meth. 
Enzym. .99:387 (1983)], indicated that the predominant amino 
acid residue that became phosphorylated was threonine. 

15 

EXAMPLE XV 
Identification of a Rat A ctivin Receptor 

Degenerate primers deduced from the conserved 
20 serine/ threonine kinase domains of activin/TGF/3 type II 
receptors were used to perform reverse-transcription 
polymerase chain reaction (RT-PCR) on a rat cDNA library 
derived from adult rat pituitary or brain. A mixture of 
oligo (dT) -primed cDNAs from 5 fig of total RNA were used as 
25 templates for PCR. The degenerate primers used were: 

HI: 5 ' -CGGGATCCGTNGCNGTNAARATHTTYCC-3 ' (SEQ ID NO:13) 
(a sense primer corresponding to amino acid sequence 216- 
221 of SEQ ID NO:l in kinase subdomain II); and 
30 H3: 5' -CGGGATCCYTCNGGNGCCATRTANCKYCTNGTNCC-3 ' (SEQ ID 

NO: 14) (an antisense primer corresponding to amino acid 
sequence 361-369 of SEQ ID N0:1 in the kinase subdomain 
VIII) . 

35 The primers have BairiHI sites at the 5' termini to 
facilitate the subcloning of the resulting PCR products. 
The PCR reaction included an initial denaturation step at 
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94°C for 5 min, 35 cycles of 94°C for 1 min, 46°C for 2 rain, 
and 72°C for 3 min, and a final incubation for 10 min at 
72°C. The PCR products were purified and subcloned into the 
pBluescript vector (Stratagene, La Jolla, CA) and 
5 sequenced . 

Four fragments having serine/threonine kinase motifs 
were isolated. Among them, three were previously 
characterized as ActRI (ALK2) , ActRIB (ALK4) and TSRI 
10 (ALK1) . A full length cDNA of a fourth novel clone from an 
adult rat brain cDNA library was isolated, and tentatively 
named ALK7 (activin receptor- like kinase 7) . The 
nucleotide and amino acid sequences for ALK7 are set forth 
in SEQ ID N0s:ll and 12, 

15 

The kinase domain of ALK7 shows highest sequence 
similarity to that of ActRIB and TGF/?RI (82.5% identities 
with them), and the entire amino acid sequence shows 64.0% 
identity to that of TGFjSRI, and 62,1% identity to that of 

20 ActRIB. Furthermore, ALK7 has a "GS domain" almost 
identical to TGFjSRI and ActRIB, and contains cystein 
residues in the extracellular ligand binding domain 
conserved among the receptor serine kinase superfamily. 
This indicates that ALK7 may function as a type I receptor 

25 for the TGF-j3 superfamily. 

RNase protection assays using RNAs isolated from 
various rat brain, kidney, stomach, spleen, heart, skin, 
skeletal muscle, ovary and testis were conducted to 
30 determine the expression patterns of the ALK7 gene. 
Although ALK7 mRNA is not expressed at a high level in 
adult tissues, it is clearly detectable in brain and to a 
lesser extent in kidney and ovary. 

35 Functional characterization of ALK7 or an ALK7 mutant 

ALK7 (T194D) ) was performed in the mink lung cell-line 
"RIB", Chinese Hamster Ovary cell-line (CHO) , and human 
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myelogenous leukemia cell (K562) . These cells were 
transf ected with ALK7 or an ALK7 (T194D) along with the 
transcriptional reporter construct (3TP-Lux) . The mutant 
(ALK7 (T194D) ) has an aspartate residue at position 194 in 
5 the "GS domain" instead of threonine. The plasmid p3TP- 
Lux, which contains three copies of a TPA- responsive 
element and the promoter of the human plasminogen activator 
inhibitor-1 (PA1-1) linked to the lucif erase reporter gene, 
has been shown to be responsive to TGF/3 or activin (see, 

10 e.g, Carcamo et al., 1994, Molec. Cell Biol. , 14 : 3810~ 
3821) . After 24 hours of transf ection, cells were cultured 
in medium containing 0,2-0.5% serum with or without ligands 
for 12-24 hours, and the lucif erase avtivity of cell 
lysates was measured. Although the physiological ligand 

15 that activates ALK7 has yet to be determined, ALK7 (T194D) 
activates the transcriptional response at a level 
approximately 3-4 fold higher than the wild type protein, 
indicating that the mutant is constitutively active. 

20 While the invention has been described in detail 

with reference to certain preferred embodiments thereof, it 
will be understood that modifications and variations are 
within the spirit and scope of that which is described and 
claimed. 



25 
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SUMMARY OF SEQUENCES 

Sequence ID No. 1 is the nucleic acid sequence 
(and the deduced amino acid sequence) of a cDNA encoding a 
5 mouse-derived activin receptor of the present invention. 

Sequence ID No. 1' is a nucleic acid sequence 
encoding a human-derived activin receptor of the present 
invention. Sequence ID No. 1' is substantially the same as 

10 Sequence ID No. 1, except that the codon for amino acid 
residue number 39 encodes lysine (i.e., nucleotides 185-187 
are AAA or AAG) , the codon for amino acid residue 92 
encodes valine (i.e., nucleotides 344-346 are GTN, wherein 
N is A, C, G or T) , and the codon for amino acid residue 

15 number 288 encodes glutamine (i.e., nucleotides 932-934 are 
CAA or CAG) . 

Sequence ID No. 2 is the deduced amino acid 
sequence of a mouse-derived activin receptor of the present 
20 invention. 

Sequence ID No. 2' is an amino acid sequence for 
a human-derived activin receptor of the present invention. 
Sequence ID No, 2' is substantially the same as Sequence ID 
25 No. 2, except that amino acid residue number 39 is lysine, 
amino acid residue 92 is valine, and amino acid residue 
number 288 is glutamine. 

Sequence ID No. 3 is the nucleic acid sequence 
3 0 (and the deduced amino acid sequence) of a cDNA encoding a 
Xenopus -derived activin receptor of the present invention. 

Sequence ID No. 4 is the deduced amino acid 
sequence of a Xenopus -derived activin receptor of the 
35 present invention. 
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Sequence ID No. 5 is the amino acid sequence of 
the VIB subdomain of the serine kinase consensus sequence. 

Sequence ID No. 6 is the amino acid sequence of 
5 the VIII subdomain of the serine kinase consensus sequence. 

Sequence ID No. 7 is the amino acid sequence of 
the VIB subdomain of the invention activin receptor. 

10 Sequence ID No. 8 is the amino acid sequence of 

the VIII subdomain of the invention activin receptor. 

Sequence ID No. 9 is the amino acid sequence of 
the VIB subdomain of the tyrosine kinase consensus 
15 sequence. 

Sequence ID No. 10 is the amino acid sequence of 
the VIII subdomain of the tyrosine kinase consensus 
sequence . 

20 

Sequence ID No. 11 is the nucleic acid sequence 
(and the deduced amino acid sequence) of a cDNA encoding 
rat-derived activin receptor of the present invention. 

25 Sequence ID No. 12 is the deduced amino acid 

sequence of a rat -derived activin receptor of the present 
invention. 

Sequence ID No. 13 is the HI degenerate primer 
3 0 employed in Example XV. 

Sequence ID No, 14 is the H3 degenerate primer 
employed in Example XV. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

(i) APPLICANT: Mathews, Lawrence S. 

Vale, Wylie W. 
Tsuchida, Kunihiro 

<ii) TITLE OF INVENTION: CLONING AND RECOMBINANT PRODUCTION OF 
RECEPTOR (S) OF THE ACTI VIN/TGF- BETA SUPERFAMILY 

(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pretty, Schroeder, Brueggemann & Clark 

(B) STREET: 444 South Flower Street, Suite 2000 

(C) CITY: Los Angeles 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP : 90071 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/300,584 

(B) FILING DATE: 02 -SEP- 1994 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/880,220 

(B) FILING DATE: 08 -MAY- 1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/773,229 

(B) FILING DATE: 09-OCT-1991 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/698,709 

(B) FILING DATE: 10-MAY-1991 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Reiter, Stephen E. 

(B) REGISTRATION NUMBER: 31,192 

(C) REFERENCE/DOCKET NUMBER: P41 9927 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619-546-4737 

(B) TELEFAX: 619-546-9392 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIjOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 71.. 1609 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:l: 

CTCCGAGGAA GACCCAGGGA ACTGGATATC TAGCGAGAAC TTCCTACGGC TTCTCCGGCG 60 

CCTCGGGAAA ATG GGA GCT GCT GCA AAG TTG GCG TTC GCC GTC TTT CTT 109 
Met Gly Ala Ala Ala Lys Leu Ala Phe Ala Val Phe Leu 
15 10 

ATC TCT TGC TCT TCA GGT GCT ATA CTT GGC AGA TCA GAA ACT CAG GAG 157 
lie Ser Cys Ser Ser Gly Ala lie Leu Gly Arg Ser Glu Thr Gin Glu 
15 20 25 

TGT CTT TTC TTT AAT GCT AAT TGG GAA AGA GAC AGA ACC AAC CAG ACT 205 
Cys Leu Phe Phe Asn Ala Asn Trp Glu Arg Asp Arg Thr Asn Gin Thr 
30 35 40 45 

GGT GTT GAA CCT TGC TAT GGT GAT AAA GAT AAA CGG CGA CAT TGT TTT 253 
Gly Val Glu Pro Cys Tyr Gly Asp Lys Asp Lys Arg Arg His Cys Phe 
50 55 60 

GCT ACC TGG AAG AAT ATT TCT GGT TCC ATT GAA ATA GTG AAG CAA GGT 301 
Ala Thr Trp Lys Asn lie Ser Gly Ser lie Glu lie Val Lys Gin Gly 
65 70 75 

TGT TGG CTG GAT GAT ATC AAC TGC TAT GAC AGG ACT GAT TGT ATA GAA 34 9 

Cys Trp Leu Asp Asp lie Asn Cys Tyr Asp Arg Thr Asp Cys lie Glu 
80 85 90 

AAA AAA GAC AGC CCT GAA GTG TAC TTT TGT TGC TGT GAG GGC AAT ATG 397 
Lys Lys Asp Ser Pro Glu Val Tyr Phe Cys Cys Cys Glu Gly Asn Met 
95 100 105 

TGT AAT GAA AAG TTC TCT TAT TTT CCG GAG ATG GAA GTC ACA CAG CCC 445 
Cys Asn Glu Lys Phe Ser Tyr Phe Pro Glu Met Glu Val Thr Gin Pro 
HO 115 120 125 

ACT TCA AAT CCT GTT ACA CCG AAG CCA CCC TAT TAC AAC ATT CTG CTG 493 
Thr Ser Asn Pro Val Thr Pro Lys Pro Pro Tyr Tyr Asn He Leu Leu 
130 135 140 

TAT TCC TTG GTA CCA CTA ATG TTA ATT GCA GGA ATT GTC ATT TGT GCA 541 
Tyr Ser Leu Val Pro Leu Met Leu He Ala Gly He Val He Cys Ala 
145 150 155 

TTT TGG GTG TAC AGA CAT CAC AAG ATG GCC TAC CCT CCT GTA CTT GTT 58 9 

Phe Trp Val Tyr Arg His His Lys Met Ala Tyr Pro Pro Val Leu Val 
160 165 170 

CCT ACT CAA GAC CCA GGA CCA CCC CCA CCT TCC CCA TTA CTA GGG TTG 63 7 

Pro Thr Gin Asp Pro Gly Pro Pro Pro Pro Ser Pro Leu Leu Gly Leu 
175 180 185 

AAG CCA TTG CAG CTG TTA GAA GTG AAA GCA AGG GGA AGA TTT GGT TGT 685 
Lys Pro Leu Gin Leu Leu Glu Val Lys Ala Arg Gly Arg Phe Gly Cys 
190 195 200 205 



GTC TGG AAA GCC CAG TTG CTC AAT GAA TAT GTG GCT GTC AAA ATA TTT 
Val Trp Lys Ala Gin Leu Leu Asn Glu Tyr Val Ala Val Lys He Phe 
210 215 220 



733 
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CCA ATA CAG GAC AAA CAG TCC TGG CAG AAT GAA TAT GAA GTC TAT AGT 781 
Pro lie Gin Asp Lys Gin Ser Trp Gin Asn Glu Tyr Glu Val Tyr Ser 
225 230 235 

CTA CCT GGA ATG AAG CAT GAG AAC ATA CTA CAG TTC ATT GGT GCA GAG 82 9 

Leu Pro Gly Met Lys His Glu Asn lie Leu Gin Phe lie Gly Ala Glu 
240 245 250 

AAA AGA GGC ACC AGT GTG GAT GTG GAC CTG TGG CTA ATC ACA GCA TTT 877 
Lys Arg Gly Thr Ser Val Asp Val Asp Leu Trp Leu lie Thr Ala Phe 
255 260 265 

CAT GAA AAG GGC TCA CTG TCA GAC TTT CTT AAG GCT AAT GTG GTC TCT 92 5 

His Glu Lys Gly Ser Leu Ser Asp Phe Leu Lys Ala Asn Val Val Ser 
270 275 280 285 

TGG AAT GAA CTT TGT CAT ATT GCA GAA ACC ATG GCT AGA GGA TTG GCA 973 
Trp Asn Glu Leu Cys His lie Ala Glu Thr Met Ala Arg Gly Leu Ala 
290 295 300 

TAT TTA CAT GAG GAT ATA CCT GGC TTA AAA GAT GGC CAC AAG CCT GCA 1021 
Tyr Leu His Glu Asp lie Pro Gly Leu Lys Asp Gly His Lys Pro Ala 
305 310 315 

ATC TCT CAC AGG GAC ATC AAA AGT AAA AAT GTG CTG TTG AAA AAC AAT 1069 
lie Ser His Arg Asp lie Lys Ser Lys Asn Val Leu Leu Lys Asn Asn 
320 325 330 

CTG ACA GCT TGC ATT GCT GAC TTT GGG TTG GCC TTA AAG TTC GAG GCT 1117 
Leu Thr Ala Cys lie Ala Asp Phe Gly Leu Ala Leu Lys Phe Glu Ala 
335 340 345 

GGC AAG TCT GCA GGT GAC ACC CAT GGG CAG GTT GGT ACC CGG AGG TAT 116 S 

Gly Lys Ser Ala Gly Asp Thr His Gly Gin Val Gly Thr Arg Arg Tyr 
350 355 360 365 

ATG GCT CCA GAG GTG TTG GAG GGT GCT ATA AAC TTC CAA AGG GAC GCA 1213 
Met Ala Pro Glu Val Leu Glu Gly Ala lie Asn Phe Gin Arg Asp Ala 
370 375 380 

TTT CTG AGG ATA GAT ATG TAC GCC ATG GGA TTA GTC CTA TGG GAA TTG 1261 
Phe Leu Arg lie Asp Met Tyr Ala Met Gly Leu Val Leu Trp Glu Leu 
385 390 395 

GCT TCT CGT TGC ACT GCT GCA GAT GGA CCC GTA GAT GAG TAC ATG TTA 1309 
Ala Ser Arg Cys Thr Ala Ala Asp Gly Pro Val Asp Glu Tyr Met Leu 
400 405 410 

CCA TTT GAG GAA GAA ATT GGC CAG CAT CCA TCT CTT GAA GAT ATG CAG 1357 
Pro Phe Glu Glu Glu lie Gly Gin His Pro Ser Leu Glu Asp Met Gin 
415 420 425 

GAA GTT GTT GTG CAT AAA AAA AAG AGG CCT GTT TTA AGA GAT TAT TGG 1405 
Glu Val Val Val His Lys Lys Lys Arg Pro Val Leu Arg Asp Tyr Trp 
430 435 440 445 

CAG AAA CAT GCA GGA ATG GCA ATG CTC TGT GAA ACG ATA GAA GAA TGT 1453 
Gin Lys His Ala Gly Met Ala Met Leu Cys Glu Thr lie Glu Glu Cys 
450 455 460 

TGG GAT CAT GAT GCA GAA GCC AGG TTA TCA GCT GGA TGT GTA GGT GAA 1501 
Trp Asp His Asp Ala Glu Ala Arg Leu Ser Ala Gly Cys Val Gly Glu 
465 470 475 

AGA ATT ACT CAG ATG CAA AGA CTA ACA AAT ATC ATT ACT ACA GAG GAC 1549 
Arg lie Thr Gin Met Gin Arg Leu Thr Asn lie He Thr Thr Glu Asp 
480 485 490 
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ATT GTA ACA GTG GTC ACA ATG GTG ACA AAT GOT GAC TTT CCT CCC AAA 1597 
lie Val Thr Val Val Thr Met Val Thr Asn Val Asp Phe Pro Pro Lys 
495 500 505 

GAA TCT AGT CTA TGATGGTGGC ACCGTCTGTA CACACTGAGG ACTGGGACTC 1649 

Glu Ser Ser Leu 

510 

TGAACTGGAG CTGCTAAGCT AAGGAAAGTG CTTAGTTGAT TTTCTGTGTG AAATGAGTAG 1709 

GATGCCTCCA GGACATGTAC GCAAGCAGCC CCTTGTGGAA AGCATGGATC TGGGAGATGG 1769 

ATCTGGGAAA CTTACTGCAT CGTCTGCAGC ACAGATATGA AGAGGAGTCT AAGGGAAAAG 1829 

CTGCAAACTG TAAAGAACTT CTGAAAATGT ACTCGAAGAA TGTGGCCCTC TCCAAATCAA 1889 

GGATCTTTTG GACCTGGCTA ATCAAGTATT TGCAAAACTG ACATCAGATT TCTTAATGTC 194 9 

TGTCAGAAGA CACTAATTCC TTAAATGAAC TACTGCTATT TTTTTTAAAT GAAAAACTTT 2009 

TCATTTCAGA TTTTAAAAAG GGTAACTTTT TATTGCATTT GCTGTTGTTT CTATAAATGA 2069 

CTATTGTAAT GCCAACATGA CACAGCTTGT GAATGTGTAG TGTGCTGCTG TTCTGTGTAC 2129 

ATAGTCATCA AAGTGGGGTA CAGTAAAGAG GCTTCCAAGC ATTACTTTAA CCTCCCTCAA 2189 

CAAGGTATAC CTCAGTTCCA CGGTTGTTAA ATTATAAAAT TGAAAACACT AACAGAATTT 224 9 

GAATAAATCA GTCCATGTTT TATAACAAGG TTAATTACAA ATTCACTGTG TTATTTAAGA 2309 

AAAAATGGTA AGCTATGCTT AGTGCCAATA GTAAGTGGCT ATTTGTAAAG CAGTGTTTTA 2369 

GCTTTTCTTC TACTGGCTTG TAATTTAGGG AAAACAAGTG CTGTCTTTGA AATGGAAAAG 2429 

AATATGGTGT CACCCTACCC CCCATACTTA TATCAAGGTC CCAAAATATT CTTTTCCATT 2489 

TCAAAGACAG CACTTTGAAA ACCCTAAATT ACAAGCCAGT AGAAGAAAAG CTAAAACACG 2549 

CTTTACAAAT AGCC 2563 

(2) INFORMATION FOR SEQ ID NO:2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 513 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Gly Ala Ala Ala Lys Leu Ala Phe Ala Val Phe Leu lie Ser Cys 
1 5 10 15 

Ser Ser Gly Ala lie Leu Gly Arg Ser Glu Thr Gin Glu Cys Leu Phe 
20 25 30 

Phe Asn Ala Asn Trp Glu Arg Asp Arg Thr Asn Gin Thr Gly Val Glu 
35 40 45 

Pro Cys Tyr Gly Asp Lys Asp Lys Arg Arg His Cys Phe Ala Thr Trp 
50 55 60 

Lys Asn lie Ser Gly Ser lie Glu lie Val Lys Gin Gly Cys Trp Leu 
€5 70 75 80 
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Asp Asp lie Asn Cys Tyr Asp Arg Thr Asp Cys lie Glu Lys Lys Asp 
85 90 95 

Ser Pro Glu Val Tyr Phe Cys Cys Cys Glu Gly Asn Met Cys Asn Glu 
100 105 110 

Lys Phe Ser Tyr Phe Pro Glu Met Glu Val Thr Gin Pro Thr Ser Asn 
115 120 125 

Pro Val Thr Pro Lys Pro Pro Tyr Tyr Asn lie Leu Leu Tyr Ser Leu 
130 135 140 

Val Pro Leu Met Leu lie Ala Gly lie Val lie Cys Ala Phe Trp Val 
145 150 155 160 

Tyr Arg His His Lys Met Ala Tyr Pro Pro Val Leu Val Pro Thr Gin 
165 170 175 

Asp Pro Gly Pro Pro Pro Pro Ser Pro Leu Leu Gly Leu Lys Pro Leu 
180 185 190 

Gin Leu Leu Glu Val Lys Ala Arg Gly Arg Phe Gly Cys Val Trp Lys 
195 200 205 

Ala Gin Leu Leu Asn Glu Tyr Val Ala Val Lys lie Phe Pro lie Gin 
210 215 220 

Asp Lys Gin Ser Trp Gin Asn Glu Tyr Glu Val Tyr Ser Leu Pro Gly 
225 230 235 240 

Met Lys His Glu Asn lie Leu Gin Phe lie Gly Ala Glu Lys Arg Gly 
245 250 255 

Thr Ser Val Asp Val Asp Leu Trp Leu lie Thr Ala Phe His Glu Lys 
260 265 270 

Gly Ser Leu Ser Asp Phe Leu Lys Ala Asn Val Val Ser Trp Asn Glu 
275 280 285 

Leu Cys His lie Ala Glu Thr Met Ala Arg Gly Leu Ala Tyr Leu His 
290 295 300 

Glu Asp lie Pro Gly Leu Lys Asp Gly His Lys Pro Ala lie Ser His 
305 310 315 320 

Arg Asp lie Lys Ser Lys Asn Val Leu Leu Lys Asn Asn Leu Thr Ala 
325 330 335 

Cys lie Ala Asp Phe Gly Leu Ala Leu Lys Phe Glu Ala Gly Lys Ser 
340 345 350 

Ala Gly Asp Thr His Gly Gin Val Gly Thr Arg Arg Tyr Met Ala Pro 
355 360 365 

Glu Val Leu Glu Gly Ala lie Asn Phe Gin Arg Asp Ala Phe Leu Arg 
370 375 380 

lie Asp Met Tyr Ala Met Gly Leu Val Leu Trp Glu Leu Ala Ser Arg 
385 390 395 400 

Cys Thr Ala Ala Asp Gly Pro Val Asp Glu Tyr Met Leu Pro Phe Glu 
405 410 415 

Glu Glu lie Gly Gin His Pro Ser Leu Glu Asp Met Gin Glu Val Val 
420 425 430 
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Val His Lys Lys Lys Arg Pro Val Leu Arg Asp Tyr Trp Gin Lys His 
435 440 445 

Ala Gly Met Ala Met Leu Cys Glu Thr lie Glu Glu Cys Trp Asp His 
450 455 460 

Asp Ala Glu Ala Arg Leu Ser Ala Gly Cys Val Gly Glu Arg lie Thr 
465 470 475 480 

Gin Met Gin Arg Leu Thr Asn lie lie Thr Thr Glu Asp lie Val Thr 
485 490 495 

Val Val Thr Met Val Thr Asn Val Asp Phe Pro Pro Lys Glu Ser Ser 
500 505 510 

Leu 



{2) INFORMATION FOR SEQ ID NO; 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2335 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: XACTR 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 468.. 1997 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



CCGCCCACAC 


AGTGCAGTGA 


ATAATAGCCG 


GTGCGGCCCC 


TCCCCTCTTT 


CCCTGGCAGT 


60 


TGTGTATCTG 


TCACATTGAA 


GTTTGGG CTC 


CTGTGAGTCT 


GAGCCTCCCC 


CTGTGTCTCA 


120 


TGTGAAGCTG 


CTGCTGCAGA 


AGGTGGAGTC 


GTTGCATGAG 


GGTGGGGGGA 


GTCGCTGCTG 


180 


TTTGATCTGC 


CTCTGCTCCC 


CATTCACACT 


CTCATTTCAT 


TCCCACGGAT 


CCACATTACA 


240 


ACTCGCCTTT 


AACCCTTTCC 


CTGGCGGAGC 


CCACGCGTCT 


TTCATCCCTC 


CTGCCGCGGC 


300 


CGCTGAGCGA 


CCAGAGCGCG 


ACATTGTTGC 


GGCGGGGGAT 


TGGGCGACAT 


TGTTGCGAAT 


360 


AATCGGAGCT 


GCTGGGGGGG 


AACTGATACA 


ACGTTGCGAC 


TGTAAAGGAA 


TTAACTCGGC 


420 


CGAATGGGAT 


TTTATCTGTG 


TCGGTGAGAG 


AAGCGGATCC 


CAGGAGC ATG GGG GCG 


476 



Met Gly Ala 
1 

TCT GTA GCG CTG ACT TTT CTA CTT CTT CTT GCA ACT TTC CGC GCA GGC 524 
Ser Val Ala Leu Thr Phe Leu Leu Leu Leu Ala Thr Phe Arg Ala Gly 
5 10 15 

TCA GGA CAC GAT GAA GTG GAG ACA AGA GAG TGC ATC TAT TAC AAT GCC 572 
Ser Gly His Asp Glu Val Glu Thr Arg Glu Cys lie Tyr Tyr Asn Ala 
20 25 30 35 
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AAC TGG GAA CTG GAG AAG ACC AAC CAA AGT GGG GTG GAA AGC TGC GAA 620 
Asn Trp Glu Leu Glu Lys Thr Asn Gin Ser Gly Val Glu Ser Cys Glu 
40 45 50 

GGG GAA AAG GAC AAG CGA CTC CAC TGT TAC GCG TCT TGG AGG AAC AAT 66 8 

Gly Glu Lys Asp Lys Arg Leu His Cys Tyr Ala Ser Trp Arg Asn Asn 
55 60 65 

TCG GGC TTC ATA GAG CTG GTG AAA AAA GGA TGC TGG CTG GAT GAC TTC 716 
Ser Gly Phe lie Glu Leu Val Lys Lys Gly Cys Trp Leu Asp Asp Phe 
70 75 80 

AAC TGT TAT GAC AGA CAG GAA TGT ATT GCC AAG GAA GAA AAC CCC CAA 764 
Asn Cys Tyr Asp Arg Gin Glu Cys lie Ala Lys Glu Glu Asn Pro Gin 
85 90 95 

GTC TTT TTC TGC TGC TGC GAG GGA AAC TAC TGC AAC AAG AAA TTT ACT 812 
Val Phe Phe Cys Cys Cys Glu Gly Asn Tyr Cys Asn Lys Lys Phe Thr 
100 105 110 H5 

CAT TTG CCT GAA GTC GAA ACA TTT GAT CCG AAG CCC CAG CCG TCA GCC 860 
His Leu Pro Glu Val Glu Thr Phe Asp Pro Lys Pro Gin Pro Ser Ala 
120 125 130 

TCC GTA CTG AAC ATT CTG ATC TAT TCC CTG CTT CCA ATT GTT GGT CTT 908 
Ser Val Leu Asn lie Leu He Tyr Ser Leu Leu Pro He Val Gly Leu 
135 140 145 

TCC ATG GCA ATT CTC CTG GCG TTC TGG ATG TAC CGT CAT CGA AAG CCT 956 
Ser Met Ala He Leu Leu Ala Phe Trp Met Tyr Arg His Arg Lys Pro 
150 155 160 

CCC TAC GGG CAT GTA GAG ATC AAT GAG GAC CCC GGT CTG CCC CCT CCA 1004 
Pro Tyr Gly His Val Glu He Asn Glu Asp Pro Gly Leu Pro Pro Pro 
165 170 175 

TCT CCT CTG GTC GGG CTG AAG CCG CTG CAG TTG CTG GAG ATA AAG GCG 1052 
Ser Pro Leu Val Gly Leu Lys Pro Leu Gin Leu Leu Glu He Lys Ala 
180 185 190 195 

CGA GGC CGT TTC GGT TGC GTC TGG AAA GCT CGT CTG CTG AAT GAA TAT 1100 
Arg Gly Arg Phe Gly Cys Val Trp Lys Ala Arg Leu Leu Asn Glu Tyr 
200 205 210 

GTC GCA GTG AAA ATC TTC CCC GTG CAG GAT AAG CAG TCG TGG CAG TGT 1148 
Val Ala Val Lys He Phe Pro Val Gin Asp Lys Gin Ser Trp Gin Cys 
215 220 225 

GAG AAA GAG ATC TTC ACC ACG CCG GGC ATG AAA CAT GAA AAC CTA TTG 1196 
Glu Lys Glu He Phe Thr Thr Pro Gly Met Lys His Glu Asn Leu Leu 
230 235 240 

GAG TTC ATT GCC GCT GAG AAG AGG GGA AGC AAC CTG GAG ATG GAG CTG 1244 
Glu Phe He Ala Ala Glu Lys Arg Gly Ser Asn Leu Glu Met Glu Leu 
245 250 255 

TGG CTC ATC ACT GCA TTT CAT GAT AAG GGT TCT CTG ACG GAC TAC CTG 1292 
Trp Leu He Thr Ala Phe His Asp Lys Gly Ser Leu Thr Asp Tyr Leu 
260 265 270 275 

AAA GGG AAC TTG GTG AGC TGG AAT GAA CTG TGT CAC ATA ACA GAA ACA 134 0 

Lys Gly Asn Leu Val Ser Trp Asn Glu Leu Cys His He Thr Glu Thr 
280 285 290 

ATG GCT CGT GGG CTG GCC TAC TTA CAT GAA GAT GTG CCC CGC TGT AAA 138 8 

Met Ala Arg Gly Leu Ala Tyr Leu His Glu Asp Val Pro Arg Cys Lys 
295 300 305 
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QGT GAA GGG CAC AAA CCT GCA ATC GCT CAC AGA GAT TTT AAA AGT AAG 

Gly Glu Gly His Lys Pro Ala lie Ala His Arg Asp Phe Lys Ser Lys 
310 315 320 



1436 



AAT GTA TTG CTA AGA AAC GAC CTG ACT GCG ATA TTA GCA GAC TTC GGG 1484 
Asn Val Leu Leu Arg Asn Asp Leu Thr Ala lie Leu Ala Asp Phe Gly 
325 330 335 

CTG GCC GTA CGA TTT GAG CCT GGA AAA CCT CCG GGA GAT ACA CAC GGG 1532 
Leu Ala Val Arg Phe Glu Pro Gly Lys Pro Pro Gly Asp Thr His Gly 
340 345 350 355 

CAG GTT GGC ACC AGG AGG TAT ATG GCT CCT GAG GTT CTA GAG GGA GCA 1580 
Gin Val Gly Thr Arg Arg Tyr Met Ala Pro Glu Val Leu Glu Gly Ala 
360 365 370 

ATT AAC TTT CAG CGA GAT TCC TTT CTC AGG ATA GAT ATG TAT GCC ATG 1628 
He Asn Phe Gin Arg Asp Ser Phe Leu Arg He Asp Met Tyr Ala Met 
375 380 385 

GGA CTG GTA CTC TGG GAA ATA GTA TCC CGA TGT ACA GCA GCA GAT GGG 1676 
Gly Leu Val Leu Trp Glu He Val Ser Arg Cys Thr Ala Ala Asp Gly 
390 395 400 



CCA GTA GAT GAG TAT CTG CTC CCA TTC GAA GAA GAG ATT GGG CAA CAT 
Pro Val Asp Glu Tyr Leu Leu Pro Phe Glu Glu Glu He Gly Gin His 
405 410 415 



1724 



CCT TCC CTA GAG GAT CTG CAA GAA GTT GTC GTT CAC AAG AAG ATA CGC 1772 
Pro Ser Leu Glu Asp Leu Gin Glu Val Val Val His Lys Lys He Arg 
420 425 430 435 

CCT GTA TTC AAA GAC CAC TGG CTG AAA CAC CCT GGT CTG GCC CAA CTG 1820 
Pro Val Phe Lys Asp His Trp Leu Lys His Pro Gly Leu Ala Gin Leu 
440 445 450 

TGC GTC ACC ATT GAA GAA TGC TGG GAC CAT GAT GCG GAA GCA CGG CTT 1868 
Cys Val Thr He Glu Glu Cys Trp Asp His Asp Ala Glu Ala Arg Leu 
455 460 465 

TCG GCA GGC TGC GTA GAG GAG CGT ATT TCC CAA ATC CGT AAA TCA GTG 1916 
Ser Ala Gly Cys Val Glu Glu Arg He Ser Gin He Arg Lys Ser Val 
470 475 480 

AAC GGC ACT ACC TCG GAC TGC CTT GTA TCC ATT GTT ACA TCT GTC ACC 1964 
Asn Gly Thr Thr Ser Asp Cys Leu Val Ser He Val Thr Ser Val Thr 
485 490 495 

AAT GTG GAC TTG CCG CCC AAA GAG TCC AGT ATC TGAGGTTTCT TTGGTCTTTC 2017 
Asn Val Asp Leu Pro Pro Lys Glu Ser Ser He 
500 505 510 



CAGACTCAGT 


GACTTTTAAA 


AAAAAAACTC 


ACGAATGCAG 


CTGCTATTTT 


ATCTTGACTT 


2077 


TTTAATATTT 


TTTTTCTTGG 


ATTTTACTTG 


GATCGGATCA 


ATTTACCAGC 


ACGTCATTCG 


2137 


AAAGTATTAA 


AAAAAAAAAA 


CAAAACAAAA 


AAGCAAAAAC 


AGACATCTCA 


GCAAGCATTC 


2197 


AGGTGCCGAC 


TTATGAATGC 


CAATAGGTGC 


AGGAACTTCA 


GAACCTCAAC 


AAACTCATTT 


2257 


CTAGAGAATG 


TTCTCCTGGT 


TTCCTTTATC 


T CAG AAG AGG 


ACCCATAGGA 


AAACACCTAA 


2317 



GTCAAGCAAA TGCTGCAG 



2335 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Gly Ala Ser Val Ala Leu Thr Phe Leu Leu Leu Leu Ala Thr Phe 
1 5 10 15 

Arg Ala Gly Ser Gly His Asp Glu Val Glu Thr Arg Glu Cys lie Tyr 
20 25 30 

Tyr Asn Ala Asn Trp Glu Leu Glu Lys Thr Asn Gin Ser Gly Val Glu 
35 40 45 

Ser Cys Glu Gly Glu Lys Asp Lys Arg Leu His Cys Tyr Ala Ser Trp 
50 55 60 

Arg Asn Asn Ser Gly Phe lie Glu Leu Val Lys Lys Gly Cys Trp Leu 
65 70 75 80 

Asp Asp Phe Asn Cys Tyr Asp Arg Gin Glu Cys lie Ala Lys Glu Glu 
85 90 95 

Asn Pro Gin Val Phe Phe Cys Cys Cys Glu Gly Asn Tyr Cys Asn Lys 
100 105 110 

Lys Phe Thr His Leu Pro Glu Val Glu Thr Phe Asp Pro Lys Pro Gin 
115 120 125 

Pro Ser Ala Ser Val Leu Asn lie Leu lie Tyr Ser Leu Leu Pro lie 
130 135 140 

Val Gly Leu Ser Met Ala lie Leu Leu Ala Phe Trp Met Tyr Arg His 
145 150 155 160 

Arg Lys Pro Pro Tyr Gly His Val Glu lie Asn Glu Asp Pro Gly Leu 
165 170 175 

Pro Pro Pro Ser Pro Leu Val Gly Leu Lys Pro Leu Gin Leu Leu Glu 
180 185 190 

lie Lys Ala Arg Gly Arg Phe Gly Cys Val Trp Lys Ala Arg Leu Leu 
195 200 205 

Asn Glu Tyr Val Ala Val Lys lie Phe Pro Val Gin Asp Lys Gin Ser 
210 215 220 

Trp Gin Cys Glu Lys Glu lie Phe Thr Thr Pro Gly Met Lys His Glu 
225 230 235 240 

Asn Leu Leu Glu Phe lie Ala Ala Glu Lys Arg Gly Ser Asn Leu Glu 
245 250 255 

Met Glu Leu Trp Leu lie Thr Ala Phe His Asp Lys Gly Ser Leu Thr 
260 265 270 

Asp Tyr Leu Lys Gly Asn Leu Val Ser Trp Asn Glu Leu Cys His He 
275 280 285 

Thr Glu Thr Met Ala Arg Gly Leu Ala Tyr Leu His Glu Asp Val Pro 
290 295 300 
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Arg Cys Lys Gly Glu Gly His Lys Pro Ala lie Ala His Arg Asp Phe 
305 310 315 320 

Lys Ser Lys Asn Val Leu Leu Arg Asn Asp Leu Thr Ala lie Leu Ala 
325 330 335 

Asp Phe Gly Leu Ala Val Arg Phe Glu Pro Gly Lys Pro Pro Gly Asp 
340 345 350 

Thr His Gly Gin Val Gly Thr Arg Arg Tyr Met Ala Pro Glu Val Leu 
355 360 365 

Glu Gly Ala He Asn Phe Gin Arg Asp Ser Phe Leu Arg He Asp Met 
370 375 380 

Tyr Ala Met Gly Leu Val Leu Trp Glu He Val Ser Arg Cys Thr Ala 
385 390 395 400 

Ala Asp Gly Pro Val Asp Glu Tyr Leu Leu Pro Phe Glu Glu Glu He 
405 410 415 

Gly Gin His Pro Ser Leu Glu Asp Leu Gin Glu Val Val Val His Lys 
420 425 430 

Lys He Arg Pro Val Phe Lys Asp His Trp Leu Lys His Pro Gly Leu 
435 440 445 

Ala Gin Leu Cys Val Thr He Glu Glu Cys Trp Asp His Asp Ala Glu 
450 455 460 

Ala Arg Leu Ser Ala Gly Cys Val Glu Glu Arg He Ser Gin He Arg 
465 470 475 480 

Lys Ser Val Asn Gly Thr Thr Ser Asp Cys Leu Val Ser He Val Thr 
485 490 495 

Ser Val Thr Asn Val Asp Leu Pro Pro Lys Glu Ser Ser He 
500 505 510 

(2) INFORMATION FOR SEQ ID NO: 5: 

{i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Leu Lys Pro Glu Asn 
1 5 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 
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(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION : /note= "Xaa at position 2 is either 
"Thr" or "Ser" . " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "Xaa a position 5 is either 
"Tyr" or "Phe" . " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Gly Xaa Xaa Xaa Xaa Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asp lie Lys Ser Lys Asn 
l 5 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 6 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Gly Thr Arg Arg Tyr Met 
1 5 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Asp Leu Ala Ala Arg Asn 

i 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(ix) FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "Xaa at position 3 is either 
"lie" or "Val" 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "Xaa at position 4 is either 
"Lys" or "Arg"." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note= "Xaa at position 6 is either 
"Thr" or "Met" . " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Xaa Pro Xaa Xaa Trp Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 72 1553 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CCCCGGGAAC TTCAAAGCGC GCTGCGGCGG CGCTCTGGGA CCCCGAAGCC TTGCACCGCC 

GCGGGGTGGC C ATG ACC CCA GCG CGC CGC TCC GCA CTG AGC CTG GCC CTC 
Met Thr Pro Ala Arg Arg Ser Ala Leu Ser Leu Ala Leu 
15 10 
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CTG CTG GTG GCA CTG GCC TCC GAC CTT GCG GCA GGA CTG AAG TGT GTG 158 
Leu Leu Val Ala Leu Ala Ser Asp Leu Ala Ala Gly Leu Lys Cys Val 
15 20 25 

TGT CTT TTG TGT GAT TCC TCA AAC TTT ACC TGC CAA ACC GAA GGA GCA 206 
Cys Leu Leu Cys Asp Ser Ser Asn Phe Thr Cys Gin Thr Glu Gly Ala 
30 35 40 45 

TGC TGG GCC TCT GTC ATG CTA ACC AAC GGG AAA GAA CAG GTG AGC AAA 254 
Cys Trp Ala Ser Val Met Leu Thr Asn Gly Lys Glu Gin Val Ser Lys 
50 55 60 

TCG TGC GTG TCC CTC CCG GAA CTA AAT GCT CAG GTC TTC TGT CAC AGT 3 02 

Ser Cys Val Ser Leu Pro Glu Leu Asn Ala Gin Val Phe Cys His Ser 
65 70 75 

TCC AAC AAC GTG ACC AAG ACC GAA TGT TGC TTC ACA GAC TTC TGC AAC 350 
Ser Asn Asn Val Thr Lys Thr Glu Cys Cys Phe Thr Asp Phe Cys Asn 
80 85 90 

AAC ATC ACT CAG CAC CTT CCC ACA GCA TCT CCA GAT GCC CCT AGA CTT 398 
Asn lie Thr Gin His Leu Pro Thr Ala Ser Pro Asp Ala Pro Arg Leu 
95 100 105 

GGC CCC ACA GAG CTG ACA GTT GTT ATC ACT GTA CCT GTT TGC CTC CTG 44 6 

Gly Pro Thr Glu Leu Thr Val Val lie Thr Val Pro Val Cys Leu Leu 
HO 115 120 125 

TCC ATC GCA GCC ATG CTA ACG ATA TGG GCC TGC CAG GAC CGC CAG TGC 4 94 

Ser He Ala Ala Met Leu Thr He Trp Ala Cys Gin Asp Arg Gin Cys 
130 135 140 

ACA TAC AGG AAG ACC AAG AGA CAC AAT GTG GAG GAA CCA CTG GCA GAG 542 
Thr Tyr Arg Lys Thr Lys Arg His Asn Val Glu Glu Pro Leu Ala Glu 
145 150 155 

TAC AGC CTT GTC AAT GCT GGA AAA ACC CTC AAA GAT CTG ATT TAT GAT 590 
Tyr Ser Leu Val Asn Ala Gly Lys Thr Leu Lys Asp Leu He Tyr Asp 
160 165 170 

GCC ACT GCC TCG GGC TCA GGA TCT GGC CCG CCT CTT TTG GTT CAA AGA 638 
Ala Thr Ala Ser Gly Ser Gly Ser Gly Pro Pro Leu Leu Val Gin Arg 
175 180 185 

ACC ATC GCA AGG ACA ATT GTA CTT CAA GAA ATC GTA GGA AAA GGT CGG 686 
Thr He Ala Arg Thr He Val Leu Gin Glu He Val Gly Lys Gly Arg 
190 195 200 205 

TTT GGG GAA GTG TGG CAC GGA AGA TGG TGT GGA GAA GAT GTG GCT GTG 734 
Phe Gly Glu Val Trp His Gly Arg Trp Cys Gly Glu Asp Val Ala Val 
210 215 220 

AAA ATA TTC TCC TCC AGA GAT GAG AGA TCT TGG TTC CGT GAG GCA GAA 782 
Lys He Phe Ser Ser Arg Asp Glu Arg Ser Trp Phe Arg Glu Ala Glu 
225 230 235 

ATT TAT CAG ACG GTA ATG CTG AGA CAT GAG AAT ATT CTC GGT TTC ATC 83 0 

He Tyr Gin Thr Val Met Leu Arg His Glu Asn He Leu Gly Phe He 
240 245 250 

GCG GCC GAC AAC AAA GAT AAT GGA ACC TGG ACT CAG CTT TGG CTT GTG 8 78 

Ala Ala Asp Asn Lys Asp Asn Gly Thr Trp Thr Gin Leu Trp Leu Val 
255 260 265 

TCA GAG TAT CAC GAG CAG GGC TCC TTA TAT GAC TAT TTG AAT AGA AAC 
Ser Glu Tyr His Glu Gin Gly Ser Leu Tyr Asp Tyr Leu Asn Arg Asn 
270 275 280 285 
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ATA GTG ACC GTG GCT GGA ATG GTC AAG TTG GCG CTT TCA ATA GCG AGT 974 
He Val Thr Val Ala Gly Met Val Lys Leu Ala Leu Ser He Ala Ser 

290 295 300 

GGT CTG GCT CAC CTA CAC ATG GAG ATC GTG GGC ACT CAA GGT AAG CCT 1022 
Gly Leu Ala His Leu His Met Glu He Val Gly Thr Gin Gly Lys Pro 
305 310 315 

GCT ATT GCT CAC CGA GAT ATA AAG TCA AAG AAT ATC TTA GTC AAA AAG 1070 
Ala He Ala His Arg Asp lie Lys Ser Lys Asn He Leu Val Lys Lys 
320 325 330 

TGT GAC ACT TGT GCC ATA GCT GAC TTA GGG CTG GCT GTG AAA CAT GAT 1118 
Cys Asp Thr Cys Ala He Ala Asp Leu Gly Leu Ala Val Lys His Asp 
335 340 345 

TCT ATC ATG AAC ACT ATA GAT ATA CCC CAG AAT CCT AAA GTG GGA ACC 1166 
Ser He Met Asn Thr He Asp He Pro Gin Asn Pro Lys Val Gly Thr 
350 355 360 365 

AAG AGG TAT ATG GCT CCC GAA ATG CTT GAT GAT ACA ATG AAC GTC AAC 1214 
Lys Arg Tyr Met Ala Pro Glu Met Leu Asp Asp Thr Met Asn Val Asn 
370 375 380 

ATC TTT GAG TCC TTC AAG CGA GCT GAC ATC TAT TCG GTG GGG CTG GTT 1262 
He Phe Glu Ser Phe Lys Arg Ala Asp He Tyr Ser Val Gly Leu Val 
385 390 395 

TAC TGG GAA ATA GCT CGA AGG TGT TCA GTT GGA GGA CTT GTT GAA GAG 1310 
Tyr Trp Glu He Ala Arg Arg Cys Ser Val Gly Gly Leu Val Glu Glu 
400 405 410 

TAC CAG TTG CCT TAT TAT GAC ATG GTG CCT TCA GAT CCT TCC ATA GAG 13 58 

Tyr Gin Leu Pro Tyr Tyr Asp Met Val Pro Ser Asp Pro Ser He Glu 
415 420 425 

GAA ATG AGG AAG GTC GTT TGT GAT CAG AAA CTG CGA CCA AAT CTC CCA 1406 
Glu Met Arg Lys Val Val Cys Asp Gin Lys Leu Arg Pro Asn Leu Pro 
4 30 435 440 445 

AAC CAG TGG CAA AGC TGT GAG GCG CTC CGG GTC ATG GGA AGA ATA ATG 1454 
Asn Gin Trp Gin Ser Cys Glu Ala Leu Arg Val Met Gly Arg He Met 
450 455 460 

CGT GAG TGC TGG TAT GCC AAC GGG GCA GCT CGC CTG ACC GCC CTG CGC 1502 
Arg Glu Cys Trp Tyr Ala Asn Gly Ala Ala Arg Leu Thr Ala Leu Arg 
465 470 475 

GTG AAG AAG ACC ATT TCT CAG CTG TGT GTC AAG GAA GAC TGT AAG GCC 1550 
Val Lys Lys Thr He Ser Gin Leu Cys Val Lys Glu Asp Cys Lys Ala 
480 485 490 

TAAGGATACA GGCGACGGGA AAGCCCTCAC CACTCTCTTT CATGTCTCCT GC 1602 
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(2) 



INFORMATION FOR SEQ ID NO: 12: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 493 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Thr Pro Ala Arg Arg Ser Ala Leu Ser Leu Ala Leu Leu Leu Val 
15 10 15 

Ala Leu Ala Ser Asp Leu Ala Ala Gly Leu Lys Cys Val Cys Leu Leu 
20 25 30 

Cys Asp Ser Ser Asn Phe Thr Cys Gin Thr Glu Gly Ala Cys Trp Ala 
35 40 45 

Ser Val Met Leu Thr Asn Gly Lys Glu Gin Val Ser Lys Ser Cys Val 
50 55 60 

Ser Leu Pro Glu Leu Asn Ala Gin Val Phe Cys His Ser Ser Asn Asn 
65 70 75 80 

Val Thr Lys Thr Glu Cys Cys Phe Thr Asp Phe Cys Asn Asn lie Thr 
85 90 95 

Gin His Leu Pro Thr Ala Ser Pro Asp Ala Pro Arg Leu Gly Pro Thr 
100 105 110 

Glu Leu Thr Val Val He Thr Val Pro Val Cys Leu Leu Ser He Ala 
115 120 125 

Ala Met Leu Thr He Trp Ala Cys Gin Asp Arg Gin Cys Thr Tyr Arg 
130 135 140 

Lys Thr Lys Arg His Asn Val Glu Glu Pro Leu Ala Glu Tyr Ser Leu 
145 150 155 160 

Val Asn Ala Gly Lys Thr Leu Lys Asp Leu He Tyr Asp Ala Thr Ala 
165 170 175 

Ser Gly Ser Gly Ser Gly Pro Pro Leu Leu Val Gin Arg Thr He Ala 
180 185 190 

Arg Thr He Val Leu Gin Glu He Val Gly Lys Gly Arg Phe Gly Glu 
195 200 205 

Val Trp His Gly Arg Trp Cys Gly Glu Asp Val Ala Val Lys He Phe 
210 215 220 

Ser Ser Arg Asp Glu Arg Ser Trp Phe Arg Glu Ala Glu He Tyr Gin 
225 230 235 240 

Thr Val Met Leu Arg His Glu Asn He Leu Gly Phe He Ala Ala Asp 
245 250 255 

Asn Lys Asp Asn Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Glu Tyr 
260 265 270 

His Glu Gin Gly Ser Leu Tyr Asp Tyr Leu Asn Arg Asn He Val Thr 
275 280 285 

Val Ala Gly Met Val Lys Leu Ala Leu Ser He Ala Ser Gly Leu Ala 
290 295 300 
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His Leu His Met Glu lie Val Gly Thr Gin Gly Lys Pro Ala He Ala 
305 310 315 320 

His Arg Asp He Lys Ser Lys Asn He Leu Val Lys Lys Cys Asp Thr 
325 330 335 

Cys Ala He Ala Asp Leu Gly Leu Ala Val Lys His Asp Ser He Met 
340 345 350 

Asn Thr He Asp He Pro Gin Asn Pro Lys Val Gly Thr Lys Arg Tyr 
355 360 365 

Met Ala Pro Glu Met Leu Asp Asp Thr Met Asn Val Asn He Phe Glu 
370 375 380 

Ser Phe Lys Arg Ala Asp He Tyr Ser Val Gly Leu Val Tyr Trp Glu 
385 390 395 400 

He Ala Arg Arg Cys Ser Val Gly Gly Leu Val Glu Glu Tyr Gin Leu 
405 410 415 

Pro Tyr Tyr Asp Met Val Pro Ser Asp Pro Ser He Glu Glu Met Arg 
420 425 430 

Lys Val Val Cys Asp Gin Lys Leu Arg Pro Asn Leu Pro Asn Gin Trp 
435 440 445 

Gin Ser Cys Glu Ala Leu Arg Val Met Gly Arg He Met Arg Glu Cys 
450 455 460 

Trp Tyr Ala Asn Gly Ala Ala Arg Leu Thr Ala Leu Arg Val Lys Lys 
465 470 475 480 

Thr He Ser Gin Leu Cys Val Lys Glu Asp Cys Lys Ala 
485 490 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CGGGATCCGT NGCNGTNAAR ATHTTYCC 28 
(2) INFORMATION FOR SEQ ID NO: 14: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
CGGGATCCYT CNGGNGCCAT RTANCKYCTN GTNCC 



35 



